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Scheme 12 
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In yet a further approach, summarized in Scheme 13, a masked reactive ftmctionality 
5 is present on the sialic acid. The masked reactive group is preferably unaffected by the 
conditions used to attach the modified sialic acid to the peptide. After the oovdent 
attachment of the modified sialic add to the peptide, the mask is removed and the pqptide is 
conjugated with an agent such as PEG, PPG, a therapeutic moiety, biomolecule or other 
agent. The agent is conjugated to the peptide in a specific manner by its reaction with the 
1 0 unmasked reactive group on the modified sugar residue. 
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Scheme 13 
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Any modified sugar can be used with its appropriate glycosyltransferase, depending 
on the teiminal sugars of the oligosacdiaride side chains of the glycopeptide (Table 3). As 
discussed above, the terminal sugar of the glycopeptide required for introduction of the 
PEGylated or PPGylated structure can be introduced naturally during expression or it can be 
produced post expression using the appropriate glycosidase(s), glycosyltransferase(s) or mix 
of glycosidase(s) and glycosyltraiisferase(s). 
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Iq another exemplary embodiment, a GlcNAc transferase, sucli as QnTI-V, is utilized 
to transfer PEGylated-GlcNc to a mamiose residue on a glycopeptide. In a still further 
exemplary embodiment, the N- and/or 0-Hnked glycan structures are enzymatically removed 
from a glycopeptide to expose an amino acid or a terminal glycosyl residue that is 
subsequently conjugated with the modified sugar. For example,, an endoglycanase is used to 
remove the N-linked structures of a glycopeptide to expose a terminal GlcNAc as a GlcNAc- 
linked-Asn on the glycopeptide. UDP-Gal-PEG and the appropriate galactosyltransferase is 
used to introduce the PEG- or PPG-galactose functionality onto the exposed GlcNAc. 

lu an alternative embodiment, the modified sugar is added directly to the peptide 
backbone using a glycosyltransferase known to transfer sugar residues to the peptide 
backbone. This exemplary anbodiment is set forth in Scheme 14. Exemplary 
glycosyltransferases useful in practicing the present invention include, but are not limited to, 
GalNAc transferases (GalNAc Tl-14), GlcNAc transferases, fucosyltransferases, 
glucosyltransferases, xylosyltransferases, mannosyltransferases and the like. Use of this 
approach allows the direct addition of modified sugars onto peptides that lack any 
carbohydrates or, altematively, onto existing glycopeptides. in both cases, the addition of the 
modified sugar occurs at specific positions on flie peptide backbone as defined by the 
substrate specificity of the glycosyltransferase and not in a random manner as occurs during 
modification of a protein's peptide backbone using chemical methods. An array of agents 
can be introduced into proteins or glycopeptides that lack the glycosyltransferase substrate 
peptide sequence by engineering the appropriate amino acid sequence into the peptide chain. 

Scheme 14 
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In each of the exemplary embodiments set forth above, one or more additional 
chemical or enzymatic modification steps can be utilized following tiie conjugation of the 
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Table 4: Lysosomal storage diseases and associated epzymatic defects 



Disease 


Enzymatic Defect 


Pompe disease 


acid cc-glucosidase (acid maltase) 


MPSI* (Hurlei disease) 


(X-L-iduronidase 


MPSn (llimter disease) 


iduionate sulfatase 


MPSni (SanfiKppo) 


heparan N-sulfetase 


MPS IV (Moiquio A) 


galactose-6-sulfatase 


iVLro I V ^^iviorquio o } 


acid p-gaiactosiuase 


"NAPQ \7TT /'Qlxf rItoAaGA^ 
IVXTO V Jd. ^Oiy Qlacdae J 




I-cell disease 


N acdylg^cosamine 1 
phosphotransferase 


Scbindlet disease 


a-N-acetylgalactosanrinidase 




(a-galactosidase B) 


Wolnjan disease 


acid lipase 


Cholesterol ester storage di 


:sease acid lipase 


Farber disease 


lysosomal acid ceramidase 


Niemann-Pick disease 


acid sphingomyelinase 


Gaucher disease 


glucocerebrosidase 


Kiabbe disease 


galactosylceramidase 


Fabry disease 


a-galactosidase A 


GMl gangliosidosis 


acid P-galactosidase 


GalactosiaHdosis 


p-galactosidase and neuramiDidasc 


Tay-Sach's disease 


hexosaminidase A 


Sandhoff disease 


hexosaminidase A and B 



*MPS = nmcopolysaccaridosis 

De Dirve first suggested that replacement of the missing lysosomal enzyme with 
5 exogenous biologically active enzyme might be a viable approach to treatment of lysosomal 
storage diseases (De Duve, Fed. Proc, 23: 1045 (1964). Since that time, various studies have 
suggested that enzyme replacement therapy may be beneficial for treating various lysosomal 
storage diseases. The best success has been shown with individuals with type I Gaucher 
disease, who have been treated with exogeaious enzyme (P-glucoca:ebrosidase), prepared 

1 0 from placenta (Ceredase™) or, more recently, recombinaatly (Cereazyme'"^. It has beea 
suggested that enzyme replacement may also be beneficial for treating Fabry's disease, as 
well as other Ij^osomal storage diseases. See, for example, Dawson et al, Fed. Res. 7(8): 
684-690 (1973) (in vitro) and Mapes et al. Science 169: 987 (1970) (in vivo). Clinical trials 
of enzyme replacement therapy have been reported for Fabry patients using infusions of 

1 5 normal plasma (Mapes et al. Science 169: 987-989 (1970)), a-galactosidase A purified from 
placenta (Brady et al., K Eng. J. Med. 279: 1163 (1973)); or a-galactosidase A purified Scorn 
spleen or plasma (Desnick et al, Proc. Natl. Acad. Sci., USA 76: 5326-5330 (1979)) and have 
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Ih addition to the maiuiose-6-phosphate, the peptides of the invenliora may be further 
derivatized with a moiety such as a water-soluble polymer, a therapeutic moiety, or an 
additional targeting moiety. Methods for attaching these and other groups are ^t forth 
herein. In an exemplary embodiment, the group other than mannose-6-phosphate is attached 
5 to the peptide via a derivatized sialic acid derivative according to Table 2, in which one or 
more of the "R" moieties is a group other than mannose-6-phosphate. 

In an exemplary embodiment, a sialic acid moiety modified with a G)z-protected 
glycine-based linker arm is prepared. The corresponding nucleotide sugar is prepared and the 
Cbz group is removed by catalytic hydrogenatibn. The resulting nucleotide sugar has an 

1 0 available, reactive amine that is contacted with an activated mannose-6-phosphate derivative, 
providing a manQose-6-phosphate derivatized nucleotide sugar that is usefid in practicing the 
methods of the invention. 

As shown in the scheme below (scheme 15), an exemplary activated mamiose-6- 
phosphate derivative is formed by converting a 2-bromo-beiizyl-piotecited phosphotriester 

1 5 into the corresponding triflate, in situ, and reactmg the triJElate with a linker having a reactive 
oxygen-containing moiety, forming an ether linkage between the sugar and the linker. The 
benzyl protecting groups are removed by catalytic hydrogenation, and the methyl ester of the 
linker is hydrolyzed, providing the corresponding carboxylic acid. The carboxylic acid is 
activated by any method known in the art An eatemplary activation procedure relies upon the 

20 conversion of the carboxylic acid to the N-hydroxysuccinimide ester. 



-208- 



wo 03/031464 



PCT/US02/32263 



Scheme 15 
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In another exemplary embodiment, as shown in the scheme below (scheme 
16), a N-acetylated sialic acid is converted to an amine by manipulation of the pymvyl 
moiety. Thus, the primary hydroxyl is converted to a sulfonate ester and reacted with sodium 
azide. The azide is catalytically reduced to the corresponding amine. The sugar is 
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moiety includes at least one bond that is degraded in vivo, releasing the therapeutic peptide 
from the targeting agent, following delivery of the conjugate to the targeted tissue or region 
of the body. 

In yet another exemplary embodiment, the in vivo distribution of the therapeutic 
5 moiety is altered via altering a glycoform on the therapeutic moiety without conjugating the 
therapeutic peptide to a targeting moiety. For example, the therapeutic peptide can be 
shunted away from uptake by the reticuloendothelial system by capping a teacminal galactose 
moiety of a glycosyl group with sialic acid (or a doivative thereof) (Figures 23 and 26). 
Sialylation to cover terminal Gal avoids uptake of the peptide by hepatic asialoglycoprotein 
10 (ASGP) receptors, and may extend the half life of the peptide as compared wilh peptides 
having only complex glycan chains, in the absence of sialylation. 

n. Peptide/Glvcopeptides of the Invesntion 

In one embodiment, the present invention provides a composition comprising multiple 

1 5 copies of a single peptide having an elemental trimarmosyl core as the primary glycan 
structure attached thereto. la preferred embodiments, the peptide may be a therapeutic 
molecule. The natural form of the peptide may comprise complex N-linked glycans or may 
be a high mannose glycan. The peptide maybe a manunalian peptide, and is preferably a 
human peptide. In some embodiments the peptide is selected from the group consisting of an 

20 immunoglobulin, erythropoietin, tissue-type activator peptide, and others (See Figure 1), 
Exemplary peptides whose glycans canbe remodeled usmg the methods of the 
invention are set forth in Figure 1 . 
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sugar (e.g., N-acetylgalactosamine, galactose, mannose, GlcNAc, glucose, fixcose or xylose) 
to a hydroxy side chain of a hydroxyamino acid, preferably serine or threonine, aJthou^ 5- 
hydroxyproliue or 5-hydroxylysine may also be used. 

Several exemplary embodiments of the iavention are discussed below. While several 
5 of these embodiments use peptides having names carried by trandemarks, and other specific 
peptides as the exemplary peptide* these examples are not confined to any specific peptide. 
The following exemplary embodiments are contemplated to include all peptide equivalents 
and variants of any peptide. Such variants include, but are not limited to, adding and deleting 
N-linked and 0-lmked glycosylation sites, and fusion proteins with added glycosylation sites. 

10 One of skill in the art will appreciate that the following embodiments and the basic methods 
disclosed therein can be appHed to many peptides with equal success. 

In one exemplary embodiment, the present invention provides methods for modifying 
Granubcyte Colony Stimulating Factor (G-CSF). Figures 27A to 27G set forth some 
examples of how this is accomplished using the methodology disclosed herein. In Figure 

1 5 27B, a G-CSF peptide lhat is expressed in a mammalian cell system is trimmed back using a 
sialidase. The residues thus exposed are modified by the addition of a sialic acid- 
poly(ethylene glycol) moiety (PEG moiety), using an appropriate donor therefor and 
ST3Gall . Figure 27C sets forth an exemplary scheme for modifying a G-CSF peptide that is 
expressed in an insect cell. The peptide is modified by adding a galactose moiety using an 

20 appropriate donor thereof and a galactosyltransferase. The galactose residues are 

functionalized with PEG via a sialic acid-PEG derivative, through the action of STBGall . In 
Figure 27D, bacterially expressed G-CSF is contacted with an N-acetylgalactosamine donor 
and N-acetylgalactosamine transferase. The peptide is functionalized with PEG, using a 
PEGylated sialic acid donor and a sialyltransferase. In Figure 27E, mammalian cell 

25 expressed G-CSF is contacted with a sialic acid donor that is modified wAh. levulinic acid, 
adding a reactive ketone to the sialic acid donor. After addition to a glycosyl residue on the 
glycan on the peptide, the ketone is derivatized with a moiety such as a hydrazine- or amine- 
PEG. In Figure 27F, bactcsrially expressed G-CSF is remodeled by contacting the peptide 
with an endo-GalNAc enzyme under conditions whare it functions in a sjoithetic, rather than 

30 a hydrolytic manner, thereby adding a PEG-Gal-GalNAc molecule from an activated 

derivative thereof. Figure 27G provides another route for remodeling bacterially expressed 
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molecule is PEGylated using trans-sialidase and PEGylated sialic acid-lactose complex. In 
Figure 28L, IFNal4C expressed in a mammalian system is sialylated using a donor of siaHc 
acid and a 2,8-sialyltransferase. In Figure 28M, IFNal4C expressed in insect or fungal cells 
first has N-acetylglucosamine added using an appropriate donor and GnT I and/or II. The 

5 molecule is then contacted with a galactosyltraosferase and a galactose donor that is 

derivatized with a reactive sialic acid via a linker, so that the polypeptide is attached to the 
reactive sialic acid via the linker and the galactose residue. The polypeptide is then contacted 
with ST3Gal3 and transferrin, aod thus becomes connected with transferrin via the sialic acid 
residue. In Figure 28N, IFNal4C expressed in either insect or ftmgal cells is first treated 

10 with endoglycanase to trim back the glycosyl groups, and is then contacted with a 

galactosyltransferase and a galactose donor that is derivatized with a reactive sialic acid via a 
linker, so that the polypeptide is attached to the reactive sialic acid via the linker and the 
galactose residue. The molecule is then contacted with ST3Gal3 and transferrin, and thus 
becomes connected with transferrin via the sialic acid residue, 

15 In anottier exemplary embodiment, the invention provides methods for modifying 

Interferon a-2a or 2b (IFNa), as shown in Figures 280 to 28EE. In Figure 28P, EFNa 
produced in mammalian cells is first treated with sialidase to trim back the glycosyl units, and 
is then PEGylated using ST3GaB and a PEGylated sialic acid donor. In Figure 28Q, IFNa 
expressed in insect cells is first galactosylated using an appropriate donor and a 
• 20 galactosyltransferase, and is then PEGylated using STSGall and a PEGylated sialic add 
donor. Figure 28R offers another me&od for remodeling IFNa expressed in bacteria: 
PEGylated N-acetylgalactosarmne is added to the protein using an appropriate donor and N- 
acetylgalactosamine transferase. In Figure 28S, IFNa ejq)ressed in mammalian cells is 
modified by capping appropriate terminal residues with a sialic acid donor that is modified 

25 with levulinic acid, adding a reactive ketone to the sialic add donor. After addition to a 
glycosyl residue of the peptide, the ketone is derivatized with a moiety such as a hydrazine- 
or amine- PEG. In Figure 28T, IFNa expressed in bacteria is PEGylated using a modified 
enzyme Endo-N-acetylgalactosamidase, which functions in a synthetic instead of a hydrolytic 
maimer, and using a N-acetylgalactosamine donor derivatized with a PEG moiety. In Figure 

30 28U, N-acetylgalactosamme is first added IFNa using an appropriate donor and N- 
acetylgalactosamine transferase, and then is PEGylated using a sialyltransferase and a 
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29C is a scheme for modifying IFN-p prodiiced by insect cells. First, N-aceiylglucosaimne is 
added to IFN-p using an appropriate donor and GnT-i and/or -JL The protein is then 
galactosylated using a galactose donor and a galactosyltransferase. Finally, IFN-P is 
PEGylated using ST3Gal3 and a donor of PEG-sialic acid. In Figure 29D, IFN-p expressed 
5 in yeast is first treated with Endo-H to trim back its glycosyl chains, and is then 

galactosylated using a galactose donor and a galactosyltransferase, and is then PEGylated 
usmg ST3Gal3 and a donor of PEGylated sialic acid. In Figure 29E, IFN-p produced by 
mammalian cells is modified by PEGylation using STSGalB and a donor of sialic acid already 
derivatized with a PEG moiety. In Figure 29F, IFN-p expressed in iasect cells first has N- 

1 0 acetylglucosamiae added by one or more of GnT I, II, IV, and V using a N- 

acetylglucosamine donor, and then is galactosylated using a galactose donor and a 
galactosyltransferase, and is then PEGylated using STSGalS and a donor of PEG-sialic acid. 
In Figure 29G, IFN-P expressed in yeast is first treated with mannosidases to trim back the 
mannosyl units, then has N-acetylglucosamine added using a N-acetylglucosamine donor and 

15 one or more of GnT I, II, IV, and V. The protein is fijrfher galactosylated using a galactose 
donor and a galactosyltransferase, and then PEGylated using ST3Gal3 and a PEG-sialic acid 
donor. In Figure 29H, mammalian cell expressed IFN-P is modified by capping appropriate 
tominal residues with a sialic acid donor that is modified with levulinic acid, adding a 
reactive ketone to the siaHc acid donor. After addition to a glycosyl residue of the peptide, 

20 the ketone is derivatized with a moiety such as a hydrazine- or amine- PEG. In Figure 291, 
mST-p expressed in amanamafian system is PEGylated usmg a donor of PEG-siafic add and a 
2,8-siaIyltransferase. In Figure 29J, IFN-P expressed by mammalian cells is first treated with 
sialidase to trim back its tenninal sialic acid residues, and then PEGylated using trans- 
sialidase and a donor of PEGylated siaHc acid. In Figure 29K, IFN-P expressed in 

25 mammalian cells is first treated with sialidase to trim back terminal sialic acid residues, then 
PEGylated using ST3Gal3 and a donor of PEG-sialic acid, and then sialylated using ST3Gal3 
and a sialic acid donor. In Figure 29L, IFN-P expressed in mammalian cells is first treated 
Willi siaHdase and galactosidase to trim back the glycosyl chains, then galactosylated using a 
galactose donor and an a-galactosyltransferase, and then PEGylated using ST3Gal3 or a 

3 0 sialyltransferase and a donor of PEG-sialic aci d. In Figure 29M, IFN-P expressed in 
mammalian cells is first treated with siaUdase to trim back the glycosyl units. It is then 
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Factor IX produced by mammaliaa cells is first treated with sialidase to trim back the 
terminal sialic acid residues, and is then PEGylated with ST3Gal3 using a PEG-sialic acid 
donor. In Figure 31C, Factor IX expressed by mammalian cells is first treated with sialidase 
to trim back the terminal sialic acid residues, it is then PEGylated using STBGalS and a PEG- 
5 sialic acid donor, and further sialylated using STSGall and a sialic acid donor. Another 

scheme for remodeling mammahan cell produced Factor IX can be found in Figure 3 ID. The 
polyp^tide is first treated with sialidase to trim back the terminal sialic acid residues, then 
galactosylated using a galactose donor and a galactosyltransferase, fijrlher sialylated using a 
sialic add donor and ST3Gal3, and then PEGylated using a donor of PEGylated sialic acid 

10 aiidST3Gall. IhFigure31E,FactorIXthatisex|>ressedinaniammaliansystemis 

PEGylated through the process of sialylation catalyzed by ST3Gal3 using a donor of PEG- 
sialic acid. In Figure 3 IF, Factor DC expressed in mammalian cells is modified by coping 
appropriate terminal residues with a siaKc acid donor that is modified with levulinic add, 
adding a reactive ketone to the sialic acid donor. After addition to a glycosyl residue of the 

1 5 p^tidCi the ketone is derivatized with a moiety such as a hydrazine- or amine- PEG. Figure 
3 IG provides an additional method of modifying Factor DC The polypeptide, produced by 
mammalian cells, is PEGylated using a donor of PEG-sialic acid and a 2,8-sialyltransferase. 

In another exemplary embodiment, the invention provides methods for modification 
ofFolhcle Stimulating Hormone (FSH). Figures 32A to 32 J present some examples; In 

20 Figure 32B, FSH is expressed m a mammalian sjflstem and modified by treatment of sialidase 
to trim back terminal sialic add residues, followed by PEGylation using ST3Gal3 and a 
donot of PEG-sialic acid. In Figure 32C, FSH expressed in mammalian cells is first treated 
with sialidase to trim back terminal sialic acid residues, then PEGylated usmg ST3Gal3 and a 
donor of PEG-sialic add, and then sialylated using ST3Gal3 and a sialic acid donor. Figure 

25 32D provides a scheme for modifying FSH exfffessed in a mammalian sj^tem. The 
polypeptide is treated with sialidase and galactosidase to trim back its sialic acid and 
galactose residues, then galactosylated using a galactose donor and a galactos54transferase, 
and then PEGylated using ST3Gal3 and a donor of PEG-sialic acid. In Figure 32E, FSH 
expressed in mammalian cells is modified in the following procedure: FSH is first treated 

30 with sialidase to trim back the sialic acid residues, then PEGylated using ST3GaI3 and a 
donor of PEG-sialic acid, and is then sialylated using ST3GaI3 and a siaHc acid donor. 
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and a galactose donor. The teminal galactose residues of the N-linked glycosyl units are 
"capped" with sialic acid, using ST3Gal3 and a sialic acid donor. The tenninal galactose 
residues are fimctionalized with a sialic acid bearing a PEG moiety, using an appropriate 
sialic acid donor and ST3Gall. Jh Figure 33E, EPO that is expressed in a manmialian cell 
system is remodeled by functionalizing the N-Hnked glycosyl residues with a PEG- 
derivatized sialic acid moiety. The peptide is contacted with STBGalB and an appropriately 
modified sialic acid donor. In Hgure 33F, EPO that is expressed in an insect cell system is 
remodeled by adding one or more terminal N-acetylglucosamine residues by contactiDg the 
peptide with a N-acetylglucosamine donor and of one or more of GnTl, GnUl, and GnTV. 
The peptide is then PEGylated by contacting it with a PEGylated galactose donor and a 
galactosyltransferase. In Figure 33G, EPO that is expressed in an iosect cell system is 
remodeled by the addition of tenninal N-acetylglucosamine residues, using an appropriate N- 
acetylglucosamine donor and one or more of QnTI, GnTH and GnTV. A galactosidase that 
is made to operate in a synthetic, rather than a hydrolytic manner is utilized to add an 
activated PEGylated galactose donor to the N-acetylglucosamine residues. In Figure 33H, a 
mutant EPO expressed in mammalian cells is remodeled by capping appropriate terminal 
residues wife a sialic acid donor that is modified with levulinic acid, adding a reactive ketone 
to the sialic acid donor. After addition to a glycosyl residue of the peptide, the ketone is 
derivatized with a moiety such as a hydrazine- or amine-PEG, Figure 331 sets forth an 
exemplary remodeling pathway for a mutant EPO that is expressed in a mammalian cell 
system. PEG is added to the glycosyl residue using a PEG-modified sialic acid and an a 2,8- 
sialyltransferase. Figure 33J sets Jforfh another exen^laty remodeling pathway for a mutant 
EPO that is expressed in a mammalian cell system. The sialic acid is added to the glycosyl 
residue with a sialic acid donor and an a2,8-sialyltransferase. 

In another exemplary embodiment, the invention provides methods for modifying 
granulocyte-macrophage colony-stimulating factor (GM-CSF), as shown in Figures 34A to 
34K;. In Figure 34B, GM-CSF expressed in mammalian cells is first treated with sialidase to 
trim back the sialic add residues, and theso. PEGylated using ST3Gal3 and a donor of PEG- 
siahc acid, in Figure 34C, GM-CSF expressed in mammalian cells is first treated ivith 
sialidase to trim back the sialic acid residues, then PEGylated using ST3Gal3 and a donor of 
PEG-sialic add, and then is fiulher sialylated usmg a sialic add donor and ST3Gall and/or 
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galactosylated using a galactose donor and a galactosyltransferase. Then, IFNy is PEGylated 
using a donor of PEG-sialic acid and ST3Gal3. In Figure 35E, lENy that is expressed in a 
mammalian system is first treated with sialidase to trim back temiinal sialic acid residues. 
The polypeptide is then PEGylated using ST3Gal3 and a donor of PEG-siaJic acid, and is 
5 further sialylated with ST3Gal3 and a sialic acid donor. Figure 35F describes another method 
jEbr modifying IFNy expressed in a mammalian system. The protein is modified by capping 
appropriate tenninal residues with a sialic acid donor that is modified with levulinic acid, 
adding a reactive ketone to the sialic acid donor. After addition to a glycosyl residue of the 
peptide, the ketone is derivatized with a moiety such as a hydrazine- or amine- PEG. In 

1 0 Figure 3 5 G, IFNy expressed in mammalian cells is remodeled by addition of sialic acid using 
a sialic acid donor and an a 2,8-sialyltransferase. In Figure 351, IFNy expressed in insect or 
fungal cells is modified by addition of N-acetylglucosamine using an appropriate donor and 
one or more of GnT I, n, IV, and V. The protein is ftirthex modified by addition of PEG 
moieties using a donor of PEGylated galactose and a galactosyltransferase. Figure 35J offers 

15 a method for modifying IFNy expressed in yeast. The polypeptide is first treated with 

endoglycanase to trim back the saccharide chains, and then galactosylated using a galactose 
donor and a galactosyltransferase. Then, IFNy is PEGylated using a donor of PEGylated 
sialic acid and ST3Gal3. In Figure 35K, IFNy produced by mammaUan cells is modified as 
follows: the polypqptide is first contacted with ST3Gal3 and a donor of sialic acid that is 

20 derivatized with a reactive galactose via a linker, so that the polypeptide is attached to the 
reactive galactose via the linker and sialic acid residue. The polypeptide is then contacted 
with a galactosyltransferase and transferrin pre-treated with endpglycanase, and thus becomes 
connected with transferrin via the galactose residue. In the scheme illustrated by Figure 35L, 
IFNy, which is expressed in a mammahan system, is modified via the action of ST3Gal3: 

25 PEGylated sialic acid is transferred firom a suitable donor to IFNy. Figure 35M is an example 
of modifying IFNy expressed in insect or fimgal cells, where PEGylation of the polypeptide 
is achieved by transferring PEGylated N-acefylglucosamine fi:om a donor to IFNy using GnT 
I and/or II. In Figure 35N, IFNy expressed in a mammalian system is remodeled with 
addition of PEGylated sialic acid using a suitable donor and an a 2,8-sialyltransferase. 

30 In another exemplary embodiment, the invention provides methods for modifying ai 

anti-trypsin (al-protease mhibitor). Some such examples can be found in Figures 36 A to 
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acid donor. After addition to a gjycosyl residue of ttie peptide^ the ketone is derivatized witii 
a moiely such as one or more mamK)se-6-phosphate groups. In Figure 37G, Cerezyme*^" 
expressed in manmialian cells is sialylated using a sialic acid donor and a 2,8- 
sialyltransferase. In Figure 371, Cerezyme™ expressed in insect cells first has N- 
5 acetylglucosamine added using a suitable donor and one or more of GnT I, II, IV, and V, and 
ttien is PEGrylated using a galactosyltransferase and a donor of PEG-galactose. In Figure 37 J, 
Cerezyme™ expressed in yeast is first treated with endoglycanase to teim back the glycosyl 
groups, then galactosylated using a galactose donor and a galactosyltransferase, and then 
PBGylated using ST3Gal3 and a donor of PEG-sialic acid. In Figure 37K, Cerezyme™ 

10 expressed in mammalian cells is first contacted witih ST3Gal3 and two reactive siaKc acid 
residues connected via a linker, so that the polypeptide is attached to one reactive sialic acid 
via the linker and the second siahc acid residue. The polypeptide is tlien contacted witii 
ST3Gal3 and desialylated transferrin, and thus becomes connected with transferrin. Then, 
the polypeptide is sialylated using a sialic acid donor and ST3Gal3. 

15 In ano&er exemplary embodiment, the inventioii provides methods for modifying 

Tissue-Type Plasminogen Activator (TPA) and its mutant. Several specific modification 
schemes are presented in Figures 38A to 38W. Figure 38B illustrates one modification 
procedure: after TPA is expressed by mammalian cells, it is treated with one or more of 
mannosidase(s) and sialidase to trim back mannosyl and/or sialic acid residues. Terminal N- 

20 acetylglucosamine is then added by contacting the polypeptide with a suitable donor of N- 
acetylglucosamine and one or more of GnT I, II, IV, and V. TPA is finther galactosylated 
using a galactose donor and a galactosyltransferase. Then, PEG is attached to the molecule 
by way of sialylation catalyzed by ST3Gal3 and using a donor of sialic acid derivatized with 
a PEG moiety, in Figure 38C, TPA is expressed in insect or fungal cells. The modification 

25 includes the steps of addition of N-acetylglucosamine using an appropriate donor of N- 
acetylglucosamine and GnT I and/or II; galactosylation using a galactose donor and a 
galactosyltransferase; and attachment of PEG by way of sialylation using STSGaB and a 
donor of sialic acid derivatized with PEG. In Figure 38D, TPA is expressed in yeast and 
subsequently treated with endoglycanase to trim back the saccharide chains. The polypeptide 

30 is further PEGylated via the action of a galactosyltransferase, which catalyzes the transfer of 
a PEG-galactose firom a donor to TPA. In Figure 38E, TPA is expressed in insect or yeast 
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expressed in bacteria is modified witli N-acetylgalactosamine using a proper donor and N- 
acetylgalactosamitie transferase, followed by a step of PEGylation with aPEG-sialic acid 
donor and a sialyltransferase. Figure 39E offers another scheme of modifying 11^2 produced 
by a mammalian system. The polypeptide is modified by capping appropriate terminal 
residues with a sialic acid donor that is modified with levulinic acid, adding a reactive ketone 
to the sialic acid donor. After addition to a glycosyl residue of the peptide, the ketone is 
derivatized with a moiety such as a hydrazine- or amine- PEG. Figure 39F illustrates an 
example of remodeling JL-2 expressed by E. coli. The polypeptide is PEGylated using a 
reactive N-acetylgalactosamine complex derivatized wiih a PEG group and an enzyme that is 
modified so that it functions as a synthetic enzyme rafher than a hydrolytic one. In Figure • 
39G, IL-2 expressed by bacteria is modified by addition of PEGylated N-acetylgalactosamine 
using a proper donor and N-acetylgalactosamine transferase. 

In another exemplary embodiment, the uavention provides methods for modifying 
Factor Vm, as shown in Figures 40A to 40N. In Figure 40B, Factor VIE e3q)ressed ui 
mammalian cells is first treated with sialidase to trim back the sialic acid residues, and is then 
PEGylated using ST3Gal3 and a donor of PEG-siaHc acid. In Figure 40C, Factor YIH 
expressed in mammalian cells is first treated with sialidase to trim back the sialic acid 
residues, then PEGylated using ST3Gal3 and a proper donor, and is then further sialylated 
usmg ST3Gall and a sialic acid donor. 

In Figure 40B, mammalian cell produced Factor VHI is modified by the singile step of 
PEGjdation, using ST3Gal3 and a donor of PEGylated sialic acid. Figure 40F offers another 
example of modification of Factor VIE that is expressed by mammalian cells. The protein is 
PEGylated using ST3Gall and a donor of PEGylated sialic acid. In Figure 40G, mammalian 
cell expressed Factor Vin is remodeled following another scheme: it is PEGylated using a 
2,8-sialyltrai)sferase and a donor of PEG-sialic acid. In Figure 40 1, Factor Vm produce by 
mammalian cells is modified by capping appropriate terminal residues with a sialic acid 
donor that is modified with levulinic acid, adding a reactive ketone to the sialic acid donor. 
After addition to a glycosyl residue of the peptide, the ketone is derivatized with a moiety 
such as a hydrazine- or amine- PEG. In Figure 40J, Factor Vm expressed by mammalian 
cells is first treated with Endo-H to trim back glycosyl groups. It is then PEGylated using a 
galactosyltransferase and a donor of PEG-galactose. In Figure 40K, Factor VHI expressed in 
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sialylated using a sialic add donor and a 2,8-siaiyltraiisferase, In Figure 411, urokinase 
expressed in insect cells is modified in ttie following steps: first, N-acetylglucosaaiine is 
added to the polypeptide using a suitable donor of N-acetylglucosamine and one or more of 
C^T I, n, IV, and V; then PEGylated galactose is added, using a galactosyltransferase and a 
5 donor of PEG-galactose. In Fi^e 41 J, urokinase ejqpressed in yeast is first treated with 
endoglycanase to trim back g^ycosyl groins, then galactosylated using a galactose donor and 
a galactosyltransferase, and then PEGylated using ST3Gal3 and a donor of PEG-sialic acid. 
In Figure 41 K, urokinase expressed in mammalian cells is first contacted with ST3Ga]3 and 
two reactive sialic acid residues that are connected via a liiJcer, so that the polypeptide is 

10 attached to one reactive sialic acid via the linker and second sialic acid residue. The 
polypeptide is then contacted with STSGall and desialylated urokinase produced in 
maromalian cells, and thus becomes connected with a second molecule of urokinase. Then, 
the whole molecule is fiiriher sialylated using a sialic donor and ST3GaIl and/or STBGalS. 
In Figure 41Ls isolated urokinase is first treated with sulfohydrolase to remove sulfate 

1 5 groups, and is then PEGylated using a sialyltransferase and a donor of PEG-sialic add. In 
Figure 41M, isolated urokinase is first treated with sulfohydrolase and hexosaminidase to 
remove sulfate groups and hexosamine groups, and then PEGylated using a 
galactosyltransferase and a donor of PEG-galactose. 

In another exemplary embodiment, the invention provides methods for modifying 

20 DNase I, as shown in Figures 42A to 42K. In Figure 42B, DNase I is expressed in a 
mammalian system and modified in the following steps: first, the protein is treated with 
sialidase to trim back the sialic acid residues; then the protein is PEGylated with STBGalS 
using a donor of PEG-sialic acid. In Figure 42C, DNase I expressed in mammalian cells is 
first treated with sialidase to trim back the sialic add residues, then PEGylated with ST3Gal3 

25 using a PEG-sialic acid donor, and is then sialylated using ST3Gal3 and a sialic acid donor. 
In Figure 42D, DNase I expressed in a mammalian system is first ejcposed to sialidase and 
galactosidase to trim back the glycosyl groups, then galactosylated using a galactose donor 
and an a-galactosyltransferase, and then PEGylated using STBGalS or sialyltransferase and a 
donor of PEG-siaUc acid. M Figure 42E, DNase I expressed in mammalian cells is first 
■ SO treated witii sialidase to trim back the sialic acid residues, then PEGylated using STSGaB and 
a PEG-siahc acid donor, and then sialylated with STBGalS using a sialic acid donor. In 
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human growth hormone-mucm fusion protein: the fusion protein is PEGylated through the 
action of N-acetylgalactosamine transferase using a donor of PEGylated N- 
acetylgalactosamine. Figure 45 J provides a further remodeling scheme for human growth 
hormone-mucin fusion protein. The fusion protein is first contacted with N- 
5 acetylgalactosamine transferase and a donor of N-acetylgalactosamine that is derivatized with 
a reactive sialic acid via a linker, so that the fusion protein is attached to the reactive sialic 
acid via flhe linker and N-acetylgalactosamine. The fusion protein is then contacted with a 
sialyltransferase and asialo-transferrin, and thus becomes connected witti transferrin via the 
sialic acid residue. Then, the fusion protein is capped with sialic acid residues using 

10 ST3Gal3 and a sialic acid donor. In Figure 45K, yet another scheme is given for 

modification of human growth hormone(N) produced in bacteria. The polypeptide is first 
contacted with NHS-CO-linker-S A-CMP and becomes coupled with the reactive sialic acid 
through the linker. The polypeptide is then contacted with ST3Gal3 and asialo-transferrin 
and becomes linked to transferrin via the sialic acid residue. Then, the polypeptide is 

1 5 sialylated using ST3Gal3 and a sialic acid donor. 

In another exemplary embodiment, the invention provides methods for remodeling 
TNF receptor IgG fusion protein (TNFR-IgG, or Enbrel™*), ais shown in Figures 46A to G. 
Figure 46B illustrates a modification procedure in which TNFR-lgG, expressed in a 
mammalian system is first sialylaled with a sialic acid donor and a sialyltransferase, 

20 ST3Gall ; the fusion protein is then galactosylated with a galactose donor and a 

galactosyltransferase; then, the fusion protein is PEGylated via the action of ST3Gal3 and a 
donor of sialic acid derivatized with PEG. In Figure 46C, TNFR-IgG expressed in 
mammalian cells is initially treated with sialidase to trim back sialic acid residues. PEG 
moieties are subsequently attached to TNFR-IgG by way of transferring PEGylated sialic 

25 acid fiom a donor to the fusion protein in a reaction catalyzed by STSGall. In Figure 46D, 
TNFR-IgG is eiqpressed in a mammalian system and modified by addition of PEG through 
the galactosylation process, which is mediated by a galactosyltransferase using a PEG- , 
galactose donor, ha Figure 46E, TNFR-IgG is expressed in a mammalian system. The first 
step in remodeling of the fusion protein is adding O-linked sialic acid residues using a siahc 

30 acid donor and a sialyltransferase, ST3Gall. Subsequently, PEGylated galactose is added to 
the fusion protein using a galactosyltransforase and a suitable donor of galactose with PEG 
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PEGylated galactose using a suitable donor and a galactosyltransferase. In Figure 49D, 
Remicade™ expressed in fimgi is first treated with Endo-H to trim back the glycosyl chains, 
then galactosylated using a galactose donor and a galactosyltransferase, and then conjugated 
to a radioisotope using ST3Gal3 and a reactive sialic acid derivatized with the radioisotope. 
5 Iq another exemplary embodiment, the invention provides methods for modifying 

Reopro, which is mutated to containan N glycosylation site. Figures 50A to SOL contain such 
examples. In Figure 50B, Reopro expressed in a mammalian system is jGrst treated with 
sialidase to trim back the sialic acid residues, and the PEGylated using ST3Gal3 and a donor 
of PEG-siaho acid. In Figure 50C, Reopro expressed in insect cells is modified by addition 

10 of PEGylated N-acetylglucosamine using an appropriate donor and GnT I and/or H. In 

Figure 50D, Reopro expressed in yeast is first treated with Endo-H to trim back the glycosyl 
groins. Subsequently, the protein is PEGylated using a galactosyltransferase and a donor of 
PEG-galactose. In Figure 50F, Reopro expressed in mammalian cells is first treated with 
sialidase to trim back the sialic acid residues and then PEGylated with STSGall using a 

15 donor of PEGylated sialic acid, hi Figure 50G, Reopro expressed in insect cells is modified 
by PEGylation using a galactosyltransferase and a donor of PEG-galactose. In Figure 50H, 
Reopro expressed in bacterial first has N-acetylgalactosamine added using N- 
acetylgalactosamine transferase and a suitable donor. The protem is then PEGylated using a 
sialyltransferase and a donor of PEG-sialic acid. In Figure 50J, Reopro expressed in bacteria 

20 is modified in a different scheme: it is PEGylated via the action of N-acetylgalactosamine 
transferase, using a donor of PEGylated N-acetylgalactosamine. In Figure 50K, bacterially 
expressed Reopro is modified in yet another method: first, the polypeptide is contacted with 
N-acetylgalactosamine transferase and a donor of N-acetylgalactosamine that is derivatized 
with a reactive siahc acid via a linker, so that the polypeptide is attached to the reactive siaUc 

25 add via the linker and N-acetylgalactosamine. The polypeptide is then contacted with 
ST3GaB and asialo-transferrin and thus becomes connected with transferrm via the siahc 
add residue. Then, the polypeptide is capped with sialic acid residues using a proper donor 
and STSGaiS. Figure 50L offers an additional scheme of modifying bacterially expressed 
Reopro. The polypeptide is first exposed to NHS-CO-hnker-SA-CMP and becomes 

30 connected with the reactive sialic acid through the linker. The polypeptide is then contacted 
with ST3Gal3 and asialo-transferrin and thus becomes connected with transferrin via the 
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invention furfher encompasses peptides that have N-linked glycan chains at other recognition 
sites where the N-linked chains are added using natural or recombinant glycosyltransferases. 

Since the recognition site for N-linked glycosjdation of a peptide is known, it is 
within the skill of persons in the art to create mutated primary peptide sequences wherein a 
5 native N-linfced glycosylation recognition site is removed, or alternatively or in addition, one 
or more additional N-glycosylation recognition sites are created. Most simply, an asparagine 
residue can be removed from the primary sequence of tbe peptide thereby removing the 
attachment site for a glycan, thiis removmg one glycan from the mature peptide. For 
example, a native recognition site with the sequence of asparagdne -serine-serine can be 

1 0 genetically engineered to have the sequence leucine -serine-serine, thus eliminating a N- 
linked glycosylation site at this position. 

Fmther, an N-linked glycosylation site can be removed by altering the residues in the 
recognition site so that even though the asparagine residue is present, one or more of the 
additional recognition residues are absent. For example, a native sequence of asparagine- 

15 serine -serine can be mutated to asparagine-serine- lvsine. thus eliminating an N-glycosylation 
site at that position. In the case of N-linked glycosylation sites comprising residues other 
than the typical recognition sites described above, the skilled artisan can determine the 
sequence and residues required for recognition by the appropriate glycosyltransferase, and 
then mutate at least one residue so the appropriate glycosyltransferase no longer recognizes 

20 that site. In other words, it is well within the skill of the artisan to manipulate the primary 
sequence of a peptide such that glycosylation sites are either created or are removed, or both, 
thereby generating a peptide having an altered glycosylation pattern. The invention should 
therefore not be construed to be limited to any primary peptide sequence provided herein as 
the sole sequence for glycan remodeling, but ratiier should be constmed to include any and all 

25 peptide sequences suitable fox glycan remodeling. 

To create a mutant peptide, the nucleic acid sequence encoding the primary sequence 
of the peptide is altered so that native codons encoding native amino acid residues are 
mutated to generate a codon encoding another amino acid residue. Techniques for altering 
nucleic acid sequence are common in the art and are described for example in any well- 

30 known molecular biology manual. 
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selected which include recombinant vectors bearing the mutated sequence arrangement. A 
genetic selection scheme was devised by Kunkel et al. (1987, Kunkel et al.. Methods 
Enzymol. 154:367-382) to enrich for clones incorporating the mutagenic oligonucleotide. 
Alternatively, the use of PGR™ with commercially available Ihermostable enzymes such as 
5 Taq polymerase may be used to incorporate a mutagenic oligonucleotide primer into an 
amplified DNA fragment that can then be cloned into an appropriate cloning or ejqpiession 
vector. The PCR'™-mediated mutagenesis procedures of Tomic et al. (1990, Nucl. Adds 
Res., 12:1656) and Upender et al. (1995, Biotechniques, 18:29-31) provide two examples of 
such protocols. A PGR™ employing a thermostable ligase in addition to a thermostable 

10 polymerase may also be used to incorporate a phosphorylated mutagenic oligonucleotide into 
an amplified DNA fragment that may then be cloned into an appropriate cloning or 
expression vector. The mutagenesis procedure described by Michael (1994, Biotechniques 
16:410-412) provides an example of one such protocol. 

Not all Asn-X-Ser/Thr sequences are N-glycosylated suggesting the context in which 

15 the motif is presented is important. In another approach, libraries of mutant peptides having 
novel N-linked consensus sites are created in order to identify novel N-linked sites that are 
glycosylated in vivo and are beneficial to the activity, stability or other characteristics of the 
peptide. 

As noted previously, the consensus sequence jfor the addition of N-linked glycan 
20 chains in glycoproteins is Asn-X-Ser/Thr where X can be any amino acid. The nucleotide 
sequence encoding the amino acid two positions to the carboxyl teoninal side of the Asn may 
be mutated to encode a Ser and/or Thr residue using standard procedures known to those of 
ordinary skill in the art. As stated above not all Asn-X-Ser/Thr sites are modified by the 
addition of glycans. Therefore, each recombinant mutated glycoprotein must be expressed in 
25 a fungal, yeast or animal or mammalian expression system and analyzed for the addition of 
an N-linked glycan chain. The techniques for the characterization of glycosylation sites are 
weU known to one skilled in the art Further, the biological fimction of the mutated 
recombinant glycoprotein can be determined using assays standard for the particular protein 
being examined. Thus, it becomes a simple matter to manipulate the primary sequence of a 
3 0 peptide and identify novel glycosylation sites contained therein, and finiiier determine the 
effect of the novel site on the biological activity of the peptide. 
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la yet another apparoach, advantageous sites for new 0-linked sites maybe found in a 
peptide by creating libraries of the peptide containing various new 0-linked sites. For 
example, the consensus amino acid sequoice for N-acetylgalactosamine addition by an N- 
acetylgalactosaminyltransferase depends on the specific transferase used. The amino acid 
5 sequence of a peptide may be scanned to identify contiguous groups of amino acids that can 
be mutated to generate potential sites for addition of 0-linked glycan chains. These 
mutations can be generated using standard procedures known to those of ordmary skill in the 
art as described previously. In order to determine if any discovered glycosylation site is 
actually glycosylated, each recombinant mutated peptide is then expressed in a suitable 
10 expression sj^tem and is subsequently analyzed for the addition of the site and/or the 
presence of an 0-linked glycan chain. 

C. Ch^ical sy nt hesis of pgjtides 

While the primary structure of peptides useful in the invention can be generated most 
efficiently in a cell-based expression s>^tan, it is within the scope of the present invention 

15 that the peptides may be generated synthetically. Chranical synthesis of peptides is well 

known in the art and include, without limitation, stepwise solid phase synthesis, and fragment 
condensation either in solution or on solid phase. A classic stepwise solid phase synthesis of 
involves covalently linking an amino add corresponding to the catboxy-terminal amino acid 
of the desired peptide chain to a solid support and extending the peptide chain toward the 

20 amino end by stepwise coupling of activated amino acid derivatives having activated 

carboxyl groups. After completion of the assembly of the fully protected solid phase bound 
peptide chain, tiie peptide-solid phase covalent attachment is cleaved by suitable dbiemistry 
and the protecting groups are removed to yield the product peptide. See, R. Merrifield, Solid 
Phase Peptide Synthesis: The Synthesis of aTetrapeptide, J. Am. Chem. Soc, 85:2149-2154 

25 (1963). The longer the peptide chain, the more challenging it is to obtain higji-puiity well- 
defined products. Due to the production of complex mixtures, the stepwise solid phase 
synthesis approach has size limitations. In general, weU-defmed peptides of 100 contiguous 
amino acid residues or more are not routinely prepared via stepwise sohd phase synthesis. 
The segment condensation method involves preparation of several peptide segments 

30 by the solid phase stepwise method, followed by cleavage from the solid phase and 
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purification of these maximally protected segments. The protected segments are condensed 
one-by-one to the first segment, which is bound to the solid phase. 

The peptides usefijl in the present invention may be synthesized by exclusive solid 
phase synthesis, partial solid phase methods, fragment condensatiQin or classical solution 
5 synthesis. These synthesis methods are well-known to those of skill in the art (see, for 

example, Menifiel4 J. Am. Chem. Soo. 85:2149 (1963), Stewart et aL, "Solid Phase Peptide 
Synthesis" (2nd Edition), (Pierce Chemical Co, 1984), Bayer and Rapp, Chem. Pept. Prot. 3:3 
(1986), Atherton et al., Solid Phase Peptide Synthesis: A Practical Approach (IRL Press 
1989), Fields and Colowick, "SoUd-Phase Peptide Synthesis," Methods mEnzymology 

10 Volume 289 (Academic Press 1997), and Lloyd-Williams et al., Chemical Approaches to the 
Synthesis of Peptides and Peptides (CRC Press, Inc. 1997)). Variations in total chemical 
synthesis s1xategies,.sucii as "native chemical ligation" and "expressed peptide ligation" are 
also standard (see, for example, Dawson et al.. Science 266:776 (1994), Hackeng et al., Proo. 
Nat'l Acad. Sci. USA 94:7845 (1997), Dawson, Methods Enzymol. 287: 34 (1997), Muir et 

15 al, Proc. Nat'l Acad. Sci. USA 95:6705 (1998), and Severinov and Muir, J. Biol. Chem. 

273:16205 (1998)). Also useful are the soUd phase peptide synthesis methods developed by 
Gryphon Sciences, South San Frandsco, CA. See, U.S. Patent Nos. 6,326,468, 6,217,873, 
6,174,530, and 6,001 ,364, all of which are mcoiporated in their entirety by reference herein. 

20 D. Post-translational modifications 

It will be appreciated to one of ordinary skill in the art that peptides may undergo 
post-translational modification besides the addition of N-Unked and/or 0-linked glycans 
thereto. It is contemplated that peptides having post-translational modifications other than 
glycosylation can be used as peptides in the mvention, as long as the desired biological 

25 activity or function ofthe peptide is maintained or improved. Such post-translational 
modifications may be natural modifications usually carried out in vivo, or engineered 
modifications of the peptide carried out in vitro. Contemplated known modifications include^ 
but are not Mmited to, acetylation, acylation, ADP-ribosylation, amidation, covalent 
attachment of flavin, covalent attachment of a heme moiety, covalent attachment of a 

30 nucleotide or nucleotide derivative, covalent attachment of a hpid or lipid derivative, covalent 
attachment of phosphotidylinositol, cross-linking, cyclisation, disulfide bond foimation. 
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maleinrides. Derivatizmg agents such as me1hyl-3-[9p-azidqphenyl)]diMopTopioiimdate 
yield photoactivatable intermediates that are capable of foiming crosslinks in the presence of 
light. Alternatively, reactive water-insoluble matrices such as cyanogen bromide activated 
carbohydrates and the reactive substrates described in U.S. Pat Nos. 3,969,287 and 3,691,016 
5 may be employed for peptide iimnobilization. 

E. Fusion T>eptides^eptides 

Peptides useful in the present invention may comprise fusion peptides. Fusion 
peptides are particularly advantageous where biological and/or functional characteristics of 

1 0 two peptides are desired to be corcibined in one peptide molecule. Such fusion peptides can 
present combinations of biological activity and function that are not found in nature to create 
novel and useful molecules of therapeutic and industrial applications. Biological activities of 
interest include, but are not lijooited to, enzymatic activity, receptor and/or ligand activity, 
immunogenic motifs, and structural domains. 

15 Such fusion peptides are well known in the art, and the methods of creation wiU be 

well-known to those in the art. For example, a human a-interferon— human fusion peptide 
has been made wherein the resulting peptide has the therapeutic benefits of a-interferon 
combined with the long calculating life of albumin, thereby creating a therapeutic 
composition that allows reduced dosing frequency and potentially reduced side effects in 

20 patients. See, Albuferon™ from Human Genome Sciences, Inc. and U.S. Patent No. 

5,766,883. Other fusion peptides include antibody molecules that are described elsewhere 
herein. 

F. Generation of smaller "bioloeicallv active" molecules 

25 The peptides used in the invention may bo variants of native peptides, wherein a 

fragment of the native peptide is used in place of the full length native peptide, in addition, 
pre-pro-, and pre-peptides are contemplated. Variant peptides may be smaller in size that ihe 
native peptide, and may comprise one or more domains of a larger peptide. Selection of 
specific peptide domains can be advantageous when the biological activity of certain domains 

30 in the peptide is desired, but the biological activity of other domains in the peptide is not 
deshed. Also included are truncations of the peptide and internal deletions which may 
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enhance the desired therapeutic effect of tiie peptide. Any such forms of a peptide is 
contemplated to be usefixl in the present invention provided that the desired biological 
activity of the peptide is preserved. 

Shorter versions of peptides may have unique advantages not found in the native 
5 peptide. In the case of human albumin, it has been found that a truncated form comprising as 
little as 63% of tiie native albumin peptide is advantageous as a plasma volume expander. 
The truncated albumin peptide is considered to be better than the native peptide for this 
therapeutic purpose because an individual peptide dose of only one-half to two-thirds that of 
natural-human serum albumin, or recombinant human serum albumin is required for the 

10 equivalent colloid osmotic effect See U.S. Patent No. 5,380,712, the entirety of which is 
' 'incorporated by reference herein. 

Smaller 'Tjiologjcally active" peptides have also been found to have enhanced 
therapeutic activity as compared to the native peptide. The therapeutic potential of IL-2 is 
limited by various side effects dominated by the vascular leak syndrome. A shorter 

15 chemically synthesized version of the peptide consisting of residues 1-30 corresponding to 
the entire a-helix was found to fold properly and contain liie natural IL-2 biological activity 
with out the attending side effects. 

G. Generation of novel peptides 

Hie peptide of the invention may be a derived from a primary sequence of a 
20 native peptide, or may be engineered using any of the many means known to those of skill in 
the art. Such oigineered peptides can be designed and/or selected because of enhanced or 
novel properties as compared with the native peptide. For example, peptides may be 
engineered to have increased enzytne reaction rates, increased or decreased biuditig afBnity 
to a substrate or ligand, increased or decreased binding afBnity to a receptor, altered 
25 specificity for a substrate^ ligand, receptor or other binding partner, increased or decreased 
stability in vitro and/or in vivo, or increased or decreased immunogenicity in an animal. 

H. Mutations 

1 . Rational design mutation 
The peptides useful in the methods of the invention may be mutated to enhance a 
30 desired biological activity or function, to diminish an undesirable property of the peptide, 
and/or to add novel activities or functions to the peptide. '^Rational peptide design" maybe 
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used to generate such altered peptides. Once the amino acid sequence and structure of the 
peptide is known and a desired mutation planned, tib mutations can be made most 
conveniently to the corresponding nucleic add codon which encodes fiie amino acid residue 
that is desired to be mutated. One of skiU in the art can easily determine how the nucleic 
5 acid sequence should be altered based on the universal genetic code, and knowledge of codon 
preferences in the expression system of choice. A mutation in a codon maybe made to 
change the amino acid residue that will be polymerized into the peptide during translation,. 
Alternatively, a codon may be mutated so that Ihe corresponding encoded amino acid residue 
is the same, but the codon choice is better suited to the desired peptide expression system. 
10 For example, cys-residues may be replaced with other amino adds to remove disulfide bonds 
from the mature peptide, catalytic domains may be mutated to alter biological activity, and in 
general, isoforms of the peptide can be engineered. Such mutations can be point mutations, 
deletions, insertions and truncations, among others. 

Techniques to mutate specific amino adds in a peptide are well known in the art. The 
15 technique of site-directed mutagenesis, discussed above, is well suited for the directed 

mutaiionof codons. The oligomicleotide-mediated mutagenesis method is also discussedin 
detail in Sambrook et al. (2001, Molecular Cloning: A Laboratory Manual, Cold Spring 
Harbor Laboratory, New York, starting at page 15.51). Systematic deletions, insertions and 
truncations can be made using linker insertion mutagenesis, digestion with nuclease Bal31, 
20 and linker-scanning mutagenesis, among other method well known to those in the art 
(Sambrook et al., 2001, Molecular Cloning: A Laboratory Manual, Cold Spring Harbor 
Laboratory, New York). 

Rational peptide design has been successfully used to increase the stability of 
enzymes with respect to Ihetmoinactivation and oxidation. For example, the stabiUly of an 
25 enzyme was improved by removal of asparagine residues in a-amylase (Declerck et al., 2000, 
J. Mol. Biol. 301 :1041-1057), the introduction of more rigid structural elements such as 
proline into a-amylase agarashi et al., 1999, Biosd. Biotechnol. Biochem. 63:1535-1540) 
and D-xylose isomerase (Zhu et aL, 1999, Peptide Eng. 12:635-638). Further, the 
introduction of additional hydrophobic contacts stabilized 3-isopropyhnalate dehydrogenase 
(Akanuma et al., 1999, Eur. J. Biochem. 260:499-504) and formate dehydrogenase obtained . 
&om Pseudomonas sp. (Rojkova et al., 1999, FEES Lett. 445:183-188). The mechanisms 



30 
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can be carried out in an. automatic DNA synthesizer, and the synthetic geaes may then be 
ligated into an appropriate expression vector. The synthesis of degenerate oligonucleotides is 
known in the art (see for example, Narang, SA (1983) Tetrahedron 39:3; Itakura et al. (1981) 
Recombinant DNA, Proc 3rd Cleveland Sympos. Macromolecules, ed. AG Walton, 
5 Amsterdam: Elsevier pp. 273-289; Itakura et al. (1984) Annu. Rev. Biochem. 53 :323; Itakura 
et al. (1984) Science 198:1056; Dee etaL (1983) Nucleic Acid Res. 11:477. Such techniques 
have been employed in the directed evolution of other peptides (see, for example, Scott et al. 
(1990) Science 249:386-390; Roberts et al. (1992) PNAS 89:2429-2433; Devlin et al. (1990) 
Science 249: 404-406; Cwirla et al. (1990) PNAS 87: 6378-6382; as well as U.S. Pat. Nos. 

10 5,223,409, 5,198,346, and 5,096,815). 

a. Directed evolution. 
Peptides useful m the methods of the invention may also be generated using "directed 
evolution" techniques. In contrast to site directed mutagenesis techniques where knowledge 
of the structure of the peptide is required, there now exist strategies to generate libraaries of 

15 mutations from which to obtain peptides with improved properties without knowledge of the 
structural features of the peptide. These strategies are generally known as "directed 
evolution" technologies and are different from traditional random mutagenesis procedures in 
that they involve subjecting the nucleic acid sequence encoding the peptide of interest to 
recursive rounds of mutation, screening and amplijBication. 

20 In some "directed evolution" techniques, the diversity in the nucleic acids obtained is 

generated by mutation methods that randomly create point mutations in tiie nucleic acid 
sequence. The point mutation techniques include, but are not limited to, "error-prone 
PCRTM" (Caldwell and Joyce, 1994; PGR Methods Appl. 2: 28-33; and Ke and Madison, 
1997, Nucleic Acids Res. 25: 3371-3372), repeated oHgonucIeotide-directed mutagenesis 

25 (Eleidhaar-Olson et al., 1991, Methods Enzymol. 208:564-586), and any of tiie 
aforementioned methods of random mutagenesis. 

Anotiier method of creating diversity upon which directed evolution can act is the use 
of mutator genes. The nucleic acid of interest is cultured in a mutator cell strain the genome 
of which typically encodes defective DNA repair genes (U.S. Patent No. 6,365,410; 

30 Sehfonova et al., 2001, Appl. Environ. Microbiol. 67:3645-3649; Long-McGie et al., 2000, 
Biotech. Bioeng. 68:121-125; see, Genencor International Inc, Palo Alto CA). 
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Achieving diversity using directed evolution techniques may also be accomplished 
using saturation mutagenesis along with degenerate primers (Gene Site Saturation 
Mutagenesis™, Diveisa Corp., San Diego, CA). In this type of saturation mutagenesis, 
degenerate primers designed to cover the length of tire nucleic acid sequence to be diversified 
are used to prime the polymerase in PGR reactions. In this manner, each codon of a coding 
sequence for an amino acid may be mutated to encode each of the lemaimng common 
nineteen amino acids. This technique may also be used to iatroduce mutations, deletions and 
insertions to specific regions of a nucleic acid coding sequence while leaving the rest of the 
nucleic acid molecule untouched. Procedures for the gene saturation technique are well 
known in the art, and can be found in U.S. Patent 6,171,820. 
b. DNA shuffling 

Novel peptides useful in tiie methods of the invention may also be generated using the 
techniques of gene-shuffling, motif-shuffling, exon-shuffling, and/or codon-shuffling 
(collectively referred to as "DNA shufflmg"). DNA shufifling techniques are may be 
employed to modulate the activities of peptides usefid in the hivention and maybe used to 
generate peptides having altered activity. See, generally, U.S. Pat. Nos. 5,605,793; 
5,811,238; 5,830,721; 5,834,252; and 5,837,458, and Stemmer et al. (1994, Nature 
370(6488):3 89-391); Crameri et al. (1998, Nature 391 (6664) :2 8 8-291); Zhang et al. (1997, 
Proc. Natl. Acad. Sci. USA 94(9):4504-4509); Stemmer et al. (1994, Proc. NatL Acad. Sci 
USA 91(22):10747-10751), Patten et al. (1997, Curr. Opinion Biotechnol. 8:724-33); 
Harayama, (1998, Trends Biotechnol. 16(2):76-82); Hansson, et al., (1999, J. Mol. Biol. 
287:265-76); and Lorenzo andBlasco (1998, Biotechniques 24(2):308-13) (each of these 
patents are hereby incorporated by reference in its entirety). 

DNA shuffling involves the assembly of two or more DNA segments by homologous 
or site-specific recombination to generate variation in the polynucleotide sequence. DNA 
shuffling has been used to generate novel variations of human immunodeficiency virus type 1 
protems (Pekrun et al., 2002, J. ViroL 76(6):2924-35), triazme hydrolases (Raillard et al. 
2001, Chem Biol 8(9): 89 1-898), murine leukemia virus (MLV) proteins (Powell et al. 2000, 
Nat Biotechnol 18(12):1279-1282), and indoleglycerol phosphate synthase (Marz et al. 2000, 
Biochranistry 39(5): 880-889). 
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combined with in vivo homologous recombination in yeast (Abecassis et al., 2000, Nucleic 
Acids Res, 28:E88; ). To maximize intergeaic recombination, single stranded DNA from . 
complementary strands of each of the nucleic acids are digested with DNase and annealed 
(Kilaichi et al., 2000, Gene 243:133-137). The blunt ends of two truncated nucleic acids of 
5 variable lengths that are linked by a cleavable sequence are then ligated to generate gene 
fiision wittiout homologous reocmbination (Sieber et al., 2001, Nat Biotechnol. 19:456-460; 
Lutz et al., 2001, Nucleic Acids Res. 29:E16; Ostenneier et aL, 1999, Nat Biotechnol. 
17:1205-1209; Lutz andBenkovic, 2000, Curr. Opin. Biotechnol. 11:319-324). 
Recombination between nucleic acids witii Uttle sequence homology iu common has also 

10 been enhanced using exonuclease-mediated blunt-ending of DNA fragments and hgating the 
fragments together to lecombine them (U.S. Patent No. 6,361,974, incorporated herein by 
reference in its entirety). The invention contenQ)lates the use of each and every variation 
described above as a means of enhancing the biological properties of any of the peptides 
and/or enzymes useftil in the methods of the invention. 

15 In addition to pubHshed protocols detailmg directed evolution and gene shuffling 

techniques, commercial services are now available that will undertake the gene shuffling and 
selection procedures on peptides of ddoice. Maxygen (Redwood City, CA) offers 
commercial services to generate custom DNA shuffled libraries. In addition, this company 
will perform customized directed evolution procedures including gene shuffling and selection 

20 on a peptide femily of choice. 

Optigenix, Inc. (Newark, DE) offers the related service of plasmid shuffling. 
Optigenix uses families of genes to obtain mutants therein having new properties. The 
nucleic acid of interest is cloned into a plasmid in an Aspergillus ej^ression system. The 
DNA of the related femily is then mtroduced into the expression system and recombination in 

25 conserved regions of the family occurs in the host. Resulting mutant DNAs are then 
expressed and the peptide produced therefrom are screened for the presence of desired 
properties and the absence of undeshed properties. 

0. Screening procedures 
Following each recursive round of "evolution," the desired peptides expressed by 

30 mutated genes are screened for characteristics of interest. The "candidate" genes are then 
amplified and pooled for the next round of DNA shuffling. The screening procedure used is 
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highly dependant on the peptide that is being "evolved" and the characteristic of interest. 
Characteristics such as peptide stability, biological activity, antigenicity, among others can be 
selected using procedures that are well knovm in the art Individual assays for the biological 
activity of preferred peptides useful in the methods of the invention axe described elsewhere 
5 herein. 

d. Combinations of tetibrnqiies 
It will be appreciated by the skilled artisan that the above techniques of mutation and 
selection can be combined with each other and with additional procedures to generate the best 
possible peptide molecule useful in the me&ods of the invention. Thus, the invention is not 

1 0 limited to any one method for the generation of peptides, and should be constraed to 

encompass any and all of the methodology described herein. For example, a procedure for 
inteoducing point mutations into a nucleic acid sequence may be performed initially, followed 
by recursive rounds of DNA shuffling, selection and amplification. The initial introduction 
of point mutations may be used to introduce diversity into a gene population where it is 

15 lacking, and the following round of DNA shuffling and screening will select and recombine 
advantageous point mutations. 

m Glvcosidases andO Vf^trflTiHfRrflRfis 
A. Glycosidases 

20 Glycosidases are glycosyltransferases that use water as an acceptor molecule, and as 

such, are typically glycoside-hydrolytic enzymes. Glycosidases can be used for the formation 
of glycosidic bonds in vitro by controlling the thermodynamics or kinetics of the reaction 
mixture. Even with modified reaction conditions, though, glycosid^e reactions can be 
difficult to work with, and glycosidases tend to give low synthetic yields as a result of the 

25 reversible transglycosylase reaction and the competing hydrolytic reaction. 

A glycosidase can function by retaining the stereochemistry at the bond being broken 
during hydrolysis or by inverting the stereochemistry at the bond being broken during 
hydrolysis, classifying the glycosidase as either a **retaimng" glycosidase or an 'Inverting" 
glycosidase, respectively. Retaining glycosidases have two critical carboxylic acid moieties 

30 present in the active site, with one carboxylate acting as an add/base catalyst and the other as 
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a nucleophile, whereas wifhliie inverting glycosidases, one carboxylic add functions as an 
add and the other ftinctions as a base. 

Methods to detennine the activity and linkage specificity of any glycosidase are well 
known in the art, including a simpUfied HPLC protocol (Jacob and Scudder, 1994, Methods 
in EnzymoL 230: 280-300). A general discussion of glycosidases and glycosidase treatment 
is found in Glycobiology, A Practical Approach, (1993, Fukuda and Kobata eds., Oxford 
University Press Inc., New York). 

Glycosidases useful in the invention include, but are not limited to, sialidase, galactosidase, 
endoglucatiase, mannosidase (i.e., a and p, ManI, Manll and ManlH,) xylosidase, fucosidase, 
Agrobacterium sp. p-glucosidase, CelMomonas ftmi mannosidase 2A, Bumicola insolens 
glycosidase, Sulfolobus solfatancus glycosidase and Bacillus licheniformis glycosidase. 

The choice of fiicosddases for use in the invention depends on the linkage of the 
fiicose to other molecules. The spedfidties of many a-fucosidases useftd in the methods of 
the invention .are well known to those in the art, and many varieties of fucosid^e are also 
coramerciaUy available (Glyko, Novato, CA; PROzyme, San Leandro, CA; Calbiochem- 
Novabiochem Corp., San Diego, CA; among others). a-Fucosidases of interest include, but 
are' not limited to, a-fucosidases from Turbo comutus, Charonia lampas. Bacillus fulminans, 
Aspergillus niger, Clostridium perfnngens. Bovine kidney (Glyko), chicken liver (Tyagarajan 
et al., 1996, Glycobiology 6:83-93) and a-fiicosidase H fsom Xcmthomonas manihotis (Glyko, 
PROzyme). Chicken liver fucosidase is particularly useful for removal of core fiicose from 
N-linked glycans. 

B. Glvcosyltransferases 

Glycosyltransferases catalyze the addition of activated sugars (donor NDP-sugars), in 
a step-wise fashion, to a protein, glycopeptidfij hpid or glycohpid or to the non-reducing end 
of a growing oligosaccharide. N-linked glyoopeptides are synthesized via a transferase and a 
Hpid-linked oUgosaccharide donor Dol-PP-NAGzGlcsMang in an en block transfer followed 

by trimming of the core, hi this case the nature of the "core" saccharide is somewhat 
different from subsequent attachments. A very large number of glycosyltransferases are 
known in the art. 
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The glycosyltransferase to be used in the present invention may be any as long as it 
can utihze the modified sugar as a sugar donor. Examples of such enzymes include Leloir 
pathway glycosyltransferase, such as galactosyltransferase, N-acetylglucosaminyltransferase, 
N-acetylgalactosaminyltransferase, fiicosyltransferase, sialyltransferase, mannosyltransferase, 
5 xylosyltransferase, ghicurononyltransJEerase and the like. 

For enzymatic saccharide syntheses that involve glycosyltransferase reactions, 
glycosyltransferase can be cloned^ or isolated from any source. Many cloned 
glycosyltransferases are known, as are their polynucleotide sequences. See, e.g., Taniguchi 
et al., 2002, Handbook of glycosyltransferases and related genes, Springer, Tokyo. 

1 0 Glycosyltransferase amino acid sequences and nucleotide sequences encoding 

glycosyltransferases jOrom which the amino acid sequences can be deduced are also found in 
various publicly available databases, including GmBank, Swiss-Prot, EMBL, and o1he(rs. 

Glycosyltransferases that can be employed in the methods of the invmtion include, 
but are not limited to, galactosyltransferases, fucosyltransferases, glucosyltransferases, N- 

15 acetylgalactosaminyltransferases, N-acetylglucosaminyltransferases, glucuronyltransferases, 
sialyltransferases, mannosyltransferases, glucuronic acid transferases, galacturonic acid 
transferases, and ohgosaccharyltransferases. Suitable glycosyltransferases include those 
obtained ftom eukaryotes, as weU as fiom prokaryotes. 

DNA encoding glycosyltransferases may be obtained by chemical synthesis, by . 

20 screening reverse transcripts of mRNA fiom appropriate cells or cell line cultures, by 

scresnng genomic libraries fix>m appropriate cells, or by combinations of these procedures. 
Screening of mKNA or genomic DNA may be carried out using oligonucleotide probes 
generated &om the glycosyltransferases nucleic acid sequence. Probes may be labeled with a • 
detectable label, such as, but not limited to, a fluorescent group, a radioactive atom or a 

25 chemiluminescent group m accordance with known procedures and used in conventional 
hybridization assays. In the alternative, glycosyltransferases nucleic acid sequences may be 
obtained by use of the polymerase chain reaction (PGR) procedure, with the PGR 
oUgonucleotide primers being produced fixjm the glycosyltransferases nucleic acid sequence. 
See, U.S. Pat. No. 4,683,195 to MuUis et al. and U.S. Pat. No. 4,683,202 to MulUs. 

30 A glycosyltransferases enzyme may be synthesized in a host cell transformed with a 

vector containing DNA encoding the glycosyltransferases enzyme. A vector is a replicable 
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DNA construct. Vectors are used either to amplify DNA encoding the glycosyltransferases 
enzyme and/or to express DNA which encodes fitie glycosyltransferases enzyme. An 
expression vector is a replicable DNA construct in which a DNA sequence encoding the 
glycosyltransferases enzyme is operably hnked to suitable control sequences capable of 
5 effecting the expression ofthe glycosyltransferases enzyme in a suitable host. The need for 
such control sequences will vary depending upon the host selected and the transformatian 
method chosen. Generally, contiol sequences include a transcriptional promoter, an optional 
operator sequetice to control transraiption, a sequence encoding suitable mRNA ribosomal 
binding sites, and sequences which control the termination of transcription and translation. 

1 0 Amplification vectors do not require expression control domains. All that is needed is the 
ability to replicate in a host, usually conferred by an origin of replication, and a selection 
gene to facilitate recognition of transfonnants. 
1. Fuoosvltransferases 
In some embodiments, a ^ycosyltransf erase used in the method ofthe invention is a 

1 5 fiicosyltransferase. Fucosyltransferases are known to those of skill in the art. Exemplary 

fiicosyltransferases include enzymes, which transfer L-fiicose from GDP-fucose to a hydroxy 
position of an acceptor sugar. Fucosyltransferases that transfer from non-nucleotide sugars to 
an acceptor are also of use in the present invention. 

In some embodiments, the acceptor sugar is, for example, fSie GlcNAc in a 

20 Gaip(l-^3,4)GlcNAcP- group in an oligosaccharide glycoside. Suitable fucosyltransferases 
for this reaction include the Gaip(l-^3,4)GlcNAcpl-a(l->3,4)fucosyltransferase (FTm E.C. 
No. 2.4.1.65), which was first characterized from human milk (see, Palcic, et al.. 
Carbohydrate Res. 190: 1-11 (19S9); Prieels, et al., J. Biol. Chem. 256: 10456-10463 (1981); 
and Nunez, et al.. Can. J. Chem. 59: 2086-2095 (1981)) and the Gaip(l-»4)(aicNAcp- 

25 afucosyltransferases (FTIV, FTV, FTVI) which are found in human serum. FTVEE (E.C. No. 
2.4.1.65), a sialyl a(2->3)Galp((1^3)GlcNAcp fiicosyltransferase, has also been 
characterized. A recombinant form ofthe Galp(l-^3,4) GlcNAcp- 
a(l->3,4)fucosyltransferase has also been characterized (see, Dumas, et al., Bioorg. Med. 
Letters 1: 425-428 (1991) and Kukowska-Latallo, et al., Genes and Development 4: 1288- 

30 1303 (1990)). Other exemplary fucosyltransferases include, for example, al,2 
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fticosyltransferase (B.C. No. 2.4.1.69). Enzymatic fiicosylation can be carried out by the 
methods described in Mollicone,, et aL, Eur. J. Biochem. 191: 169-176 (1990) or U.S. Patent 

No. 5,374,655. 

5 2. Galactosvitransferases 

Iq another groi^) of embodiments, the glycosyltransferase is a jgalactosyltransferase. 
Exemplary galactosyltransferases include a(l,3) galactosyltraasferases (E.C. No. 2.4.1.151, 
see, e.g., Dabkowski et al., Transplant Proc. 25:2921 (1993) and Yamamoto et al. Nature 345: 
229-233 (1990), bovine (GenBank j04989, Joziasse et al., J. Biol. Chem. 264: 14290-14297 
10 (1989)), murine (GenBank m26925; Larsen et al., Proc. Nat'l. Acad. Sci. USA 86: 8227-8231 
(1989)), porcine (GenBank L36152; Strahan et al., Immunogenetics 41: 101-105 (1995)). 
Another suitable al,3 galactosyltransferase is that which is involved in synthesis of the blood 
group B antigen (EC 2.4.1.37, Yamamoto c/., Biol. Chem. 265: 1146-1151 (1990) 
(human)). 

15 Also suitable for use ia the methods of the iavention are P(l,4) galactosyltransferases, 

which include, for example, EC 2.4.1.90 (LacNAc synthetase) and EC 2.4.1.22 Oactose 
synthetase) (bovine (D'Agostaro et al., Eur. J. Biochem. 183: 211-217 (1989)), hnman (Masri 
et al., Biochem. Biophys. Res. Commun. 157: 657-663 (1988)), murine (Nakazawa et aL, J. 
Biochem. 104: 165-168 (1988)), as well as E.C. 2.4.1.38 and the ceraraide 

20 galactosyltransferase (EC 2.4.1 .45, Stahl et al., J. Neurosci. Res. 38: 234-242 (1994)). Other 
suitable gdactosyltransferases include, for example, al,2 galactosyltransferases (firom e.g., 
Schizosacdiaromyces pombe, Chapell et al., Mol. Biol. Cell 5: 519-528 (1994)). For further 
suitable galactosyltransferases, see Taniguchi et aL (2002, Handbook of Glycosyltransferases 
and Related Genes, Springer, Tokyo), Guo et al. (2001, Glycobiology, 11(10):813-820), and 

25 Breton etal. (1998, J Biochem. 123:1000-1009). 

The production of proteins such as the enzyme GaDSTAc Ti.xiv fiom cloned genes by 
genetic engineering is well known. See, e.g., U.S. Pat. No. 4,761,371. One method involves 
collection of sufficient samples, then the amino acid sequence of the en2yme is determined 
by N-tenninal sequencing. This information is then used to isolate a cDNA clone encoding a 

30 full-length (mCTibrane bound) transferase which upon expression in the insect cell line S£9 
resulted in the synthesis of a jEiilly active aizyme. The acceptor specificity of the enzyme is 
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then determined using a semiquantitative analysis of the amino acids surrounding known 
glycosylation sites in 16 different proteins followed by in vitro glycosylation studies of 
synthetic peptides. This work hias demonstrated that certain amino acid residues are 
overrepresented in glycosylated peptide segments and that residues in specific positions 
5 surrounding glycosylated serine and threonine residues may have a more marked influence on 
acceptor efficiency than other amino acid moieties. 
3. Sialvltransferases 
Sialyltransferases are another type of glycosyltransferase that is useful in the 
recombinant cells and reaction mixtures of the invention. Examples of sialyltransferases that 

10 are suitable for use in the present invention include ST3Gal m (e.g., a rat or human ST3Gal 
m), STSGal IV, STSGal I, ST6Gal I, ST3Gal V, ST6Gal H, ST6GaINAc I, ST6Ga]NAc H, 
and ST6GalNAc III (the sialjdtransferase nomenclature used herdn is as described in Tsuji et 
al., Glycobiology 6: v-xiv (1996)). An exemplary a(2,3)sialyltrarisferase referred to as 
a(2,3)sialyltransferase (EC 2.4.99,6) transfers sialic acid to the non-reducing terminal Gal of 

15 a Gaipi-^3Glc disaccharide or glycoside. See, Van den Eijnden et al,, J, BioL Chem. 256: 
3159 (1981), Wdnstein et al., J. Biol. Chem. 257: 13845 (1982) and Wen et at, J. Biol. 
Chem. 267: 2101 1 (1992). Another exemplary a2,3-sialyltransferase (EC 2.4.99.4) transfers 
sialic acid to tihe non-reducing teraiinal Gal of the disaccharide or glycoside, see, Rearick et 
al., J. Biol. Chem. 254: 4444 (1979) and Gillespie et al., J. Biol. Chem. 267: 21004 (1992). 

20 Further exemplary enzymes include Gal-P-l,4-GlcNAc a-2,6 sialyltransferase (See, 
Kurosawa et al. Eur. J. Biochem. 219: 375-381 (1994)). 

Preferably, for glycosylation of carbohydrates of glycopeptides the sialyltransferase 
will be able to transfer sialic acid to the sequence Galpl,4GlcNAc-, Gal|31,3GlcNAc-, or 
Galpl,3GalNAc-, the most common penultimate sequences underlying the terminal sialic 

25 acid on Mly sialylated carbohydrate structures (see. Table 7). 2,8-Sialyltransferases capable 
of transfering sialic acid to a2,3Galpl,4GlcNAc are'also usefijl inthe methods of the 
invention. 
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Table 7. SiaMtransferases which use the Gal(31.4GlcNAc sequence as 
an acceptor substrate 



Sialyltransferase 


Source 


Sequeiice(s) formed 


Ref. 


ST6GalI 


JMamiinsIisn 


NeuAca2,6Gaipi,4GlcNAc- 


1 


STSGaim 


Manmialian 


NeuAca2,3Gaipi,4GlcNAc- 
NeuAca2,3Galp 1 ,3GlcNAc- 


1 


STSGairV 


Mammalian 


NeuAca2,3Gaipi,4GlcNAc- 
NeuAca2^Gaipi,3GlcNAc- 


1 


ST6Gain 


MammaUaii 


NeiiAoOE2,6Gaipi,4GlcNAD- 




ST6Gain 


Photobacterhm 


NeuAca2,6Galpl,4GlcNAc- 


2 


ST3GalV 


N. meningitides 
N. gonorrhoeae 


NeuAca2,3Gaipi,4GlcNAc- 


3 



1) Goochee et al., Bio/Technology 9: 1347-1355 (1991) 

2) Yamamoto et al., J. Biochem. 120: 104-110 (1996) 

5 3) Gilbert et al., J. Biol. Chem. 271 : 28271-28276 (1996) 



An example of a sialyltransferase that is usejftil in the claimed methods is STSGal m, 
which is also referred to as a(2,3) sialyltransferase (EC 2.4.99.6). This enzyme catalyzes the 
transfa: of sialic add to the Gal of a Galpl,3GlcNAc or Gaipi,4GlcNAc glycoside (see, e.g., 

10 Wen et al., J. Biol. Chem. 267: 2101 1 (1992); Van den Eijnden et al., J. Biol. Chem. 256: 
3 1 59 (1 99 1)) and is responsible for sialylation of asparagine-linked oligosaccharides in 
glycopeptides. The sialic acid is linked to a Gal with the formation of an a-linkage between 
the two saccharides. Bonding (linkage) between the saccharides is between the 2-position of 
NeuAc and the 3-position of Gal. This particular enzyme can be isolated j&om rat liver 

15 (Weinstein et al., J. Biol. Chem. 257: 13845 (1982)); the human cDNA (Sasaki et al. (1993) 
J. Biol. Chem. 268: 22782-22787; Kitagawa & Paulson (1994) J. Biol. Chem. 269: 1394- 
1401) and genomic (Kitagawa et al. (1996) J. BioL Chem. 271; 931-938) DNA sequences are 
known, facilitating production of this enzyme by recombmant expression. In a preferred 
embodiment, the claimed sialylation methods use a rat ST3Gal HE. 
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Other exemplary sialylttansforases of use in the present invention include those 
isolated from Campliylobacter jejuni, including the a(2,3). See, e.g, WO99/49051 . 

Other sialyltransferases, including those listed in Table 7, are also use&I in an 
economic and efficient large-scale process for sialylation of commercially important 
5 glycopeptides. As a simple test to find out the utility of these other enzymes, various 
amomrts of each enzyme (1-100 mU/mg protein) are reacted with asiab-ai AGP (at 1-10 
mg/ml) to compare the ability of the sialyltraaisferase of interest to sialylate glycopeptides 
relative to either bovine ST6Gal I, STSGal HI or both sialyltransferases. Alternatively, other 
glycopeptides or glycopeptides, or N-hnked oligosaccharides enzymatically released from the 

10 peptide backbone can be used in place of asialo-ai AGP for this evaluation, 

Sialyltransferases with the ability to sialylate N-liniked oligosaccharides of glycopeptides 
more efl&ciently than ST6Gal I are useftil in a practical large-scale process for peptide 
sialylation (as illustrated for STSGal IH in this disclosure). 
4. Other glvcosvltransferases 

15 One of skill in the art will understand that other glycosyltransferases can be 

substituted into similar transferase cycles as have been described in detail for the 
sialyltransferase. In particular, the gjycosyltransferase can also be, fbr instance, 
glucosyltransferases, e.g., Alg8 (Stagljov ^ al., Proc. Natl. Acad. Sci. USA 91: 5977 (1994)) 
or Alg5 (Heesen et al., Eur. J. Biochem. 224: 71 (1 994)). 

20 N-acetylgalactosaminyltransferases are also of use in practicing the present invention. 

Suitable N-acetylgalactosaminyltransferas^ include, but are not limited to, a(l,3) N- 
acetylgalactosaminyltransferase, P(l,4) N-acetylgalactosaminyltransferases (Nagata et al., J. 
Biol. Chem. 267: 12082-12089 (1992) and Smith et al., J. Biol Chem. 269: 15162 (1994)) 
and peptide N-acetylgalactosaminyltransferase (Homa et al., J. Biol. Chem. 268: 12609 

25 (1993)). SmtableN-acetylglucosaminyltransferases include GnTl (2.4.1.101, Hull ^al., 
BBRC 176: 608 (1991)), GnTII, GnTHI (Ihara etal., J. Biochem. 113: 692 (1993)), GnTIV, 
GnTV (Shoreibah et al., J. Biol. Chem. 268: 15381 (1993)) and GnTVI, 0-linkedN- 
acetylglucosaminyltransferase (Bierhuizen et al., Proc. Natl. Acad. Sci. USA 89: 9326 
(1992)), N-acetylglucosamine-l-phosphate traasfemse (Rajput et al., Biochem J. 285: 985 

30 (1992), and hyaluronan synthase. 
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Mannosyltransferases are of use to transfer modified mamose moieties. Suitable 
marmosyltransferases include a(l,2) mannosyltransferase, a(l,3) mannosyltransferase, a(l,6) 
mannosyltransferase, p(l,4) maraiosyltransferase, Dol-P-Man synthase, OChl, and Pmtl 
(see, Komfeld at al., Annu. Rev. Biochem. 54: 631-664 (1985)). 
5 Xylosyltransferases are also usefitl in the present invention. See, for example, 

Rodgers, et al., Biochem. J., 288:817-822 (1992); andElbaln, et al., U.S. Patent No., 
6,168,937. 

Ofiier suitable glycosyltransferase cycles are described in Ichikawa et al., JACS 1 14: 
9283 (1992), Wong et al., J. Org. Chem. 57: 4343 (1992), and Ichikawa et al. in 

1 0 Carbohydrates and Carbohydrate Polymers. Yaltami, ed. (ATL Press, 1993). 

Prokaryotic glycosyltransferases are also useful in practicing the invention. Such 
glycosyltransferases include enzymes involved in synthesis of lipooUgosaccharides (LOS), 
which are produced by many gram negative bacteria. The LOS typically have tenninal 
glycan sequences that mimic glycoconjugates found on the surface of human epithelial cells 

15 or in host secretions (Preston et aL, Critical Reviews in Microbiology 23(3): 139-1 80 (1996)). 
Such enzymes include, but are not limited to, the proteins of fiie rfa operons of species such 
zsE, coli and Salmonella typhimurium, which include a pi,6 galactosyitransferase and a pl,3 
galactosyltransferase (see, e.g., EMBL Accession Nos. M80599 andM86935 (E. coli); 
EMBL Accession No. S56361 (S. OipAiwttri«Jw)), a g^lucosyltransferase (Swiss-Prot 

20 Accession No. P25740 (E. coli), an p 1,2-glucosyltransferase (7;;/&J)(Swiss-Prot Accession No. 
P27129 (E. coli) and Swiss-Prot Accession No. P19817 (S. typhimurium)), and an pi,2-N- 
acetylglucosaminyltransferase {rfaK)(EMBL Accession No. U00039 {E. coif). Other 
glycosyltransferases for which amino acid sequences are known include those that are 
encoded by operons such as rfoB, which have been characterized in organisms such as 

25 Klebsiella pneumoniae, E. coli. Salmonella typhimurium. Salmonella enterica. Yersinia 
enterocolitiea, Mycobacterium lepr<^um, and the rhl operon of Pseudomonas aeruginosa. 

Also suitable for use in the present invention are glycosyltransferases that are 
involved in producing structures containing lacto-N-neotetraose, D-galactosyl-p-l,4-N- 
acetyl-D-glucosanMnyl-p-l,3-D-ga]actosyl-p-l,4-D-glucose, and the P^ blood group' 

30 trisaccharide sequence, D-galactosyl-a-1 ,4-D-galactosyl-p-l,4-D-glucose, which have been 
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identified in the LOS of the mucosal pathogens Neisseria gonnorhoeae and N. meningitidis 
(Scholten et al., J. Med. Microbiol. 41; 236-243 (1994)). The genes firomiV: meningitidis and 
N. gonorrhoeae that encode the glycosyltransferases involved in the biosynthesis of these 
structures have been identified fiiom N. meningitidis iromunotypes L3 and LI (Jennings et al., 
5 Mol. Microbiol. 18: 729-740 (1995)) and the N. gonorrhoeae mutant F62 (Gotshlich, J. Exp. 
Med. 180: 2181-2190 (1994)). JuN, meningitidis, a locos consisting of three genes, IgtA, 
IgtB and Ig E, encodes the glycosyltransferase enzymes required for addition of the last three 
of the sugars in the lacto-iV-neotetraose chain (Wakarchuk et aL, J. Biol. Chem. 271: 19166- 
73 (1996)). Recently the enzymatic activity of the IgtB and IgtA gene product was 

1 0 demonstrated, providing the first direct evidence for their proposed glycosyltransferase 

function (Wakarchuk et aL, J. Biol. Chem. 271(45): 28271-276 (1996)). ia^: gonorrhoeae, 
there are two additional genes, IgtD which adds p-D-GalNAc to the 3 position of the tetminal 
galactose of the lacto-i\r-neotetraose structure and IgtC which adds a terminal ot-D-Gal to the 
lactose element of a truncated LOS, thus creating the blood group antigen structure 

15 (Gotshlich (1994), supra.). In N. meningitidis, a separate immimotype LI also expresses the 
blood group antigen and has been shown to carry an IgtC gene (Jennings et al, (1995), 
sttpra.). Neisseria glycosyltransferases and associated genes are also described in USFN 
5,545,553 (Gotschlich). Genes for al,2-fiicosyltraiKferase and al,3-fiicosyltransferase fi:om 
Helicobacter pylori has also been characterized (Martin et al., J. Biol. Chem. 272: 21349- 

20 21356 (1997)). Also of use in the present invention are the glycosyltransferases of 

Campylobacter Jejuni (see, Taniguchi et al., 2002, Handbook of glycosyltransferases and 
related genes, Springer, Tokyo), 
B. Sulfotransferases 

The uivention also provides methods for producing peptides that include sulfated 
25 molecules, including, for example sulfated polysaccharides such as heparui, heparan sut&te, 
carragenen, and related compounds. Suitable sulfotransferases inchide, for example, 
chondroitin-6-sulphotransfCTase (chicken cDNA described by Fukuta et al., J. Biol. Chem. 
270: 18575-18580 (1995); GenBank Accession No. D499i5), glycosaminoglycanN- 
acetylglucosamine N-deacetylase/N-sulphotrausferase 1 (Dixon et al.. Genomics 26: 239-241 
30 (1995); XJL18918), and glycosaminoglycan N-acetylghicosanrine N-deacetylase/N- 

sulphotransferase 2 (murine cDNA described in Qtellana et al., J. Biol. Chem. 269: 2270- 
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2276 (1994) andEiiksson et al., J. BioL Chem. 269: 10438-10443 (1994); human cDNA 
described in GenBaiik Accession No. U2304). 
C. Cell-Bound Glvcosvltransferases 

In another embodiment, the enzymes utilized in the method of the invention are cell- 
5 bound glycosyltransferases. Although many soluble glycosyltransferases are known (see, for 
example, U.S. Pat No. 5,032,519), glycosyltransferases are generally in membrane-bound 
form when associated with cells. Many of the membrane-bound enzymes studied thus far are 
considered to be intrinsic proteins; that is, they are not released ftom the membranes by 
sonication and require detergents for solubilization. Surface glycosyltransferases have been 

1 0 identijSed on the surfaces of vertebrate and invertebrate cells, and it has also been recognized 
that tiiese surface transferases maintain c^alytic activity under physiological conditions. 
However, the more recognized ftmction of cell surface glycosyltransferases is for intercellular 
recognition (Roth, 1990, Molecular Approaches to Supracellular PhCTomena,). 

Methods have been developed to alter the glycosyltransferases expressed by cells. 

15 For example, Larsen et al., Pioc. NatL Acad, Sci. USA 86: 8227-823 1 (1989), report a 

genetic approach to isolate cloned cDNA sequences that determine expression of cell surface 
oligosaccharide structures and their cognate glycosyltransferases. A cDNA library generated 
fidm mRNA isolated firom a murine cell line known to express UDP-galactose:.p.-D- 
galactosyl-l,4-N-acet}d-D-glucosaminide oc-l,3-galactosyitransferase was transfected into 

20 COS-1 cells. The transfiscted cells were then cultured and assayed for a 1-3 
galactosyltransferase activity. 

Francisco et al., Proc. NatL Acad. Sci. USA 89: 2713-2717 (1992), disclose a method 
of anchoring p-lactamase to the external surface of Escherichia coli. A tripartite fiision 
consisting of (i) a signal sequence of an outer membrane protein, (ii) a membrane-spanning 

25 section of an outer membrane protein, and (iii) a complete mature ^-lactamase sequence is 
produced resulting in an active surface bound p-lactamase molecule. However, the Francisco 
method is limited only to prokaryotic cell systems and as recognized by the authors, requires 
the complete tripartite fusion for proper functioning. 
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D. Fusion Enzymes 

In other exemplary embodiments, the methods of the invention utilize fusion peptides 
that have more tiian one enzymatic activity ttiat is involved in synthesis of a desired 
glycopeptide conjugate. The fusion peptides can be composed of, for example, a catalyticaUy 
5 active domain of a glycosyltransferase that is joined to a catalyticaUy active domain of an 
accessory enzyme. The accessory enzyme catalytic domain can, for example, catalyze a step 
in the formation of a nucleotide sugar that is a donor for the glycosyltransferase, or catalyze a 
reaction involved in a glycosyltransferase cycle. For example, a polynucleotide that encodes 
a glycosyltransferase can be joined, in-frame, to a polynucleotide that encodes an enzyme 

10 involved in nucleotide sugar synthesis. The resulting fusion peptide can then catalyze not 
only the synthesis of the nucleotide sugar, but also the transfer of the sugar moiety to the 
acceptor molecule. The fusion peptide can be two or more cycle enzymes linked into one 
expressible nucleotide sequence. In other embodiments the fusion ppeptide includes the 
catalyticaUy active domains of two or more glycosyltransferases. See, for exanrple, U.S. 

1 5 Patent No. 5,641 ,668. The modified glycopeptides of the present invention can be readily 
designed and manufactured utilizing various suitable fusion peptides {see, for example, PCT 
Patent Application PCT/CA98/01 1 80, which was pubHshed as WO 99/3 1224 on June 24, 
1999.) 

H. Tmmobilized Enzymes 

20 In addition to ceU-bound enzymes, the present invention also provides for the use of 

enzymes that are immobilized on a solid and/or soluble support. In an exemplary 
embodiment, there is provided a glycosyltransferase that is conjugated to a PEG via an intact 
glycosyl linker according to the methods of the invention. The PEG-hnker-enzyme conjugate 
is optionaUy attached to solid support. The use of soUd siqpported enzymes in the methods of 

25 the invention simpUfies the work up of the reaction mixture and purification of the reaction 
product, and also enables the facile recovery of the enzyme. The glycosyltransferase 
conjugate is utiUzed in the methods of the invention. Other combinations of enzymes and 
supports will be apparent to those of skill in Uie art. 
. F. Mutagenesis of Glycosyltransferases 

30 The novel forms of the glycosyltransferases, sialyltransferases, suLfotransferases, and 

any other enzymes used in the method of the invention can be created using any of the 
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melliods described previously, as well as others well known to those in the art. Of particular 
interest are transferases with altered acceptor specificity and/or donor specificity. Also of 
interest are enzymes with higher conversion rates and higher stability among others. 

The techniques of rational design mutagenesis can be used when the sequence of the 
5 peptide is known. Since the sequences as well as many of the tertiary stnictures of the 
transferases and ^ucosidases used in tiie invention are known, these enzymes are ideal for 
rational design of mutants. For example, the catalytic site of the enzyme can be mutated to 
alter the donor and/or acceptor specificity of the enzyme. 

The extensive tertiary structural data on the glycosyltransferases and glycosidase 

1 0 hydrolases also make tiiese enzyme idea for mutations involving domain exchanges. 
Glycosyltransferases and glycosidase hydrolases are modular enzymes (see. Bourne and 
Henrissat, 2001, Current Opinion in Structural Biology 11:593-600). Olycosyltransferases 
are divided into two families bases on theu: structure: GT-A and GT-B. The 
glycosyltransferases of the GT-A family comprise two dissimilar domains, one involved in 

15 nucleotide binding and the other in acceptor binding. Thus, one could conveniently fuse the 
DNA sequence encoding the domain firom one gene in firame with a domain &om a second 
gene to create a new gene that encodes a protean with a new acceptor/donor specificity. Such 
exchanges of domains could additionally include the carbohydrate modules and other 
accessory domains. 

20 The techniques of random mutation and/or directed evolution, as described above, 

may also be used to create novel forms of the glycosyltransferases and glycosidases used in 
the invention. 

IV. In vitro and in vivo expression systems 

25 A. Cells for the production of glycopepttdes 

Hie action of glycosyltransferases is key to the glycosylation of peptides, thus, the 
difference in the ejq)ression of a set of glycosyltransferases in any given cell type affects the 
pattern of glycosylation on any given peptide produced in that cell. For a review of host cell 
dependent glycosylation of peptides, see Kabata and Takasaki, "Structure and Biosynthesis of 

30 Cell Surface Carbohydrates," in Cell Surface Carbohydrates and Cell Development, 1991, pp. 
1-24, Eds. Minoru Fukuda, CRC Press, Boca Raton, EL. 
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According to the present disclosure, the lype of cell in which the peptide is produced 
is relevant only with respect to the degree of remodeling required to generate a peptide 
having desired glycosylation. For example, the number and sequence of enzymatic digestion 
reactions and the number and sequence of oazymatic synthetic reactions that are required in 
5 ^fitro to generate a peptide having desired glyoosfylation will vary depending on the structure 
of the glycan on the peptide produced by a particular cell type. While the invention should in 
no way be construed to be limited to the production of peptides fiom any one particular cell 
type including any cell type disclosed her^ a discussion of sev^al cell systems is now 
presented which establishes the power of tie present invention and its independence of the 

1 0 cell type in which the peptides are generated. 

In general, and to express apeptide from a nucleic add encoding it, the nucleic acid 
must be incorporated into an expression cassette, comprising a promote element, a 
terminator element, and fee coding sequence of the peptide opearably linked between the two. 
The expression cassette is then oparably linked into a vector. Toward this end, adapters or 

1 5 linkers may be employed to join the nucleotide ftagments or otha: manipulations may be 
involved to provide for convenient restriction sites, removal of superfluous nucleotides, 
removal of restriction sites, or the like. For this purpose, in vitro mutagenesis, primer repair, 
restriction, annealing, r^ubstitutions, e.g., transitions and transversions, may be involved. A 
shuttle vector has the genetic elements necessary for replication in a cell. Some vectors may 

20 be replicated only in prokaryotes, or may be replicated in both prokaryotes and eukaryotes. 
Such a plasmid expression vector will be maintained in one or more replication systems, 
preferably two replications systems, that allow for stable maintenance within a yeast host cell 
for expression purposes, and within a prokaryotic host for cloning purposes. Many vectors 
with diverse characteristics are now available commercially. Vectors are usually plasmids or 

25 phages, but may also be cosmids or nmii-chromosomes. Conveniently, many commercially 
available vectors will have the promoter and terminator of liie expression cassette already 
present, and a multi-linker site where the coding sequence for the peptide of interest can be 
inserted. The shuttle vector containing the expression cassette is then transformed inE. coU 
where it is replicated during cell division to generate a preparation of vector that is sufficient 

30 to transform the host cells of the chosen expression system. The above methodology is well 
know to those in the art, and protocols by which to accomplish can be found Sambrook et al. 



-268" 



wo 03/031464 



PCT/US02/32263 



(2001, Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory, New 
York). 

The vector, once purified firom the ceUs in which it is ampHfied, is then transformed 
iato the cells of the expression system. The protocol for transformation depended on the kind 
5 of the cell and the nature of the vector. Transfonnants are grown in an appropriate nutrient 
medium, and, where ^ropriate, maintained under selective pressure to insure retention of 
endogenous DNA. Where expression is inducible, growth can be permitted of the yeast host 
to yield a high density of cells, and tih.en expression is induced. The secreted, mature 
heterologous peptide can be harvested by any conventional means, and purified by 

1 0 chromatography, electrophoresis, dialysis, solvent-solvent extraction, and the hke. 

The techniques of molecular cloning are well-kaown in the art Further, techniques 
for the procedures of molecular cloning can be found in Sambrook et al. (2001, Molecular 
Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, 
N.Y.); Glover et al., (1985, DNA Cloning: A Practical Approach, Volumes I and H); Gait et 

15 al., (1985, OHgonucleotide Synthesis); Hames and Higgins (1985, Nucleic Add 

Hybridization ); Hames and Higgins (1984, Transcription And Translation); Freshney et al., 
(1986, Animal Cell Culture); Perbal, (1986, Immobilized Cells And Enzymes, IRL Press); 
Perbal,(1984, A Practical Guide To Molecular Qonuig); Ausubel et al. (2002, Current 
Protocols in Molecular Biology, John Wiley & Sons, Inc.). 

20 B. Fungi and yeast 

Peptides produced in yeast are glycosylated and the glycan structures present thereon 
are primarily high mannose structures. In the case of N-glycans, the glycan structures 
produced in ye^t may contain as many as nine or more mannose residues which may or may 
not contain additional sugars added thereto. An example of the type of glycan on peptides 

25 produced by yeast cells is shown in Figure 5, left side. Irrespective of the number of 

mannose residues and the type and complexity of additional sugars added thereto, N-glycans 
as components of peptides produced in yeast cells comprise a trimannosyl core structure as 
shown in Figure 5. When the glycan structure on a peptide produced by a yeast cell is a high 
mannose structure, it is a srniple matter for the ordinary skilled artisan to remove, in vitro 

30 using available mannosidase enzymes, all of the mannose residues firom the molecule except 
for those that comprise the trimannosyl core of the glycan, theretoy generating a peptide 
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having an elemental trimannosyl core structure attached thereto. Now, using the techniques 
available in the art and armed with the present disclosure, it is a simple matter to 
enzymatically add, in vitro, additional sugar moieties to the elemental trimannosyl core 
structure to generate a peptide having a desired glycan structure attached thereto. Similarly, 
5 when the peptide produced by the yeast cell comprises a high matmose structure in addition 
to other coii]|>le3c sugars attached thereto, it is a simple matter to enzymatically cleave off all 
of tile additional sugars, including extra maimose residues, to arrive at the elemental 
trimannosyl core structure. Once the elemental trimannosjd core structure is produced, 
generation of a peptide having desired glycosylation is possible followuig the directions 

10 provided herein. 

By "yeast" is intended ascosporogenous yeasts (Endomycetales), basidiosporogenous 
yeasts, and yeast belonging to the Fungi hnperfecti (Blastomycetes). The ascosporogenous 
yeasts are divided into two families, ISpermophthoraceae and Saccharomycetaceae. The later 
is compiised of four subfamilies, Schizosaccharomycoideae (e.g., genus 

15 Schizosaccharomyces), Nadsonioideae, Lipomycoideae, and Saccharomycoideae (e.g., genera 
Pichia, Kluyveromyces, and Saccharomyces). The basidio ^erogenous yeasis include the 
genera Leucosporidium, Rhodosporidium, Sporidiobobts, Filobasidium, m6.Filobasidiella. 
Yeast belonging to the Fungi Xurperfecti are divided uito two femilies, Sporobolomycetaceae 
(e.g., g^&:a. Sporobolomyces, Btdlera) and Cryptococcaceae (e.g., genus Candida). Of 

20 particular interest to the present invention are ^ecies within the genera Saccharomyces, 
Pichia, Aspergillus, Trichoderma, Kluyveromyces, especially AT. lactis andK drosophilim, 
Candida, Hansenula, Schizpsaccaromyces, Yarrowia, and Chrysoporium. Since the 
classification of yeast may change in the future, for the purposes of this invention, yeast shall 
be defined as described in Skinner et aL, eds. 1980) Biology and Activities of Yeast (Soc. 

25 App. Bacteriol. Symp. Series No. 9). 

In addition to the foregoing, those of ordinary skill in the art are presumably &miliar 
with the biology of yeast and the manipulation of yeast genetics. See, for example, Bacila et 
al., eds. (1978, Biochemistry and Genetics of Yeast, Academic Press, New York); and Rose 
and Harrison. (1987, The Yeasts (2"^ ed.) Academic Press, London). Methods of introducing 

30 exogenous DNA mto yeast hosts are well known in the art. There are a wide variety of 
methods for transformation of yeast. Spheroplast transfbrmatian is taught by Hinnen et al 
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(1978, Proc. Nati. Acad. Sci. USA 75:1919-1933); Beggs, (1978, Nature 275(5676): 104- 
109); and Stinchcomb et d., (EPO Publication No. 45,573; herein incorporated by reference), 
Electroporation is taught by Becker and Gaurante, (1991, Methods Enzymol. 194:182-187), 
Lithium acetate is taught by Gietz et al. (2002, Methods Enzymol. 350:87-96) and Mount et 
5 al. (1996, Methods Mol Biol. 53: 139-145). For a review of transformation systems of non- 
Saccharomyces yeasts, see Wang et al. (CritRev Biotechnol. 2001 ;21 (3): 177-2 18). For 
general procedures on yeast genetic engineering, see Barr et al., (1989, Yeast genetic 
engiaeering , Butterworfhs, Boston). 

In addition to wild-type yeast and fimgal cells, there are also strains of yeast and fungi 

10 that have been mutated and/or selected to enhance the level of ejqpression of the exogenous 
gene, and the purity, the post-translational processing of the resulting peptide, and the 
recovery and purity of the mature peptide. Expression of an eixogenous peptide may also be 
direct to the cell secretory pathway, as illustrated by the expression of insulin (see (Kjeldsen, 
2000, Appl. Microbiol. Biotechnol. 54:277-286, and references cited therein). Jn. general, to 

15 cause the exogenous peptide to be secreted from the yeast cell, secretion signals derived from 
yeast genes may be used, such as those of the genes of ttie killer toxin (Stark and Boyd, 1986, 
EMBO J. 5:1995-2002) or of the alpha pheromone (Kurjan and Herskowitz, 1982, Cell 30:933; 
Brake et al., 1988, Yeast 4:S436). 

Regarding the filamentous fungi in general, methods for genetic manipulation can be 

20 found in Kinghom and Tumesr (1992, Applied Molecular Genetics of Filamentous Fungi, 
Blackie Academic and Professional, New York). Guidance on appropriate vectors can be 
found in Martinelli and Kinghom (1994, Aspergillus : 50 years, Elsevier, Amsterdam). 

1. Sacchaiomvces 

25 In Saccharomyces, suitable yeast vectors for use producing a peptide include YRp7 

(Struhl et al., Proc. Natl. Acad. Sd. USA 76: 1035-1039. 1978), YEpl3 (Broach et al.. Gene 
8: 121-133, 1979), POT vectors (Kawasaki et al, U.S. Pat. No. 4,931,373, which is 

mcorporated by reference herein), pJDB249 and pJDB219 (Beggs, Nature 275:104-108, 
1978) and derivatives thereof Preferred promoters for use in yeast include promoters for 
30 yeast glycolytic gene expression (ffitzeman et al., J. Biol. Chem. 255: 12073-12080, 1980; 
Alber and Kawasaki, J. Mol. Appl. Genet. 1: 419-434, 1982; Kawasaki, U.S. Pat. No. 
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4,599,311) or alcohol dehydrogenase genes (Yoimg et al., in Genetic Engineering of 
Mbroorganisms for C3iemicals, Hollaender et al., (eds.), p. 355, Plenum, New York, 1982; 
Ammerer, Mefh. Enzymol. 101: 192-201, 1983), and the ADH2-4*' promoter (Russell et al.. 
Nature 304: 652-654, 1983; Irani and Kilgore, U.S. patent application Ser. No. 07/784,653, 
5 CA 1,304,020 and EP 284 044, which are incorporated herein by reference). The expression 
units may also include a transcriptional terminator. A prefeired transcriptional terminator is 
the TPIl teimmator (Alber and Kawasaki, ibid). 

Examples of such yeast-bacteria shuttle vectors include Yep24 (Botstein et al. (1979) 
Gene 8:17-24; pCl (Brake et al. (1984) Proc. Natl. Acad. Sci. USA 81 :4642-4646), and ' 

10 Yrpl7 (Stnichomb et al. (1982) J. Mol. Biol. 158:157). Additionally, aplasmid expression 
vector may be a high or low copy number plasmid, the copy number generally ranging from 
about 1 to about 200. hi the case of high copy numbea: yeast vectors, there will generally be 
at least 1 0, preferably at least 20, and usually not exceeding about 150 copies of the vector in 
a single host. Depending upon the heterologous peptide selected, either a high or low copy 

1 5 number vector may be desirable, dependmg xxpon. the effect of Ihe vector and the recombinant 
peptide on the host. See, for example. Brake et al. (1984) Proc. Natl. Acad. Sci. USA 
81 :4642-4646. DNA constructs of the present invention can also be integrated into the yeast 
genome by an integrating vector. Examples of such vectors are known in the art See, for 
example, Botstem et aL (1979) Gene 8:17-24. 

20 The selection of suitable yeast and other microorganism hosts for the practice of the 

present invention is within the skill of the art. Of particular interest are the Saccharomyces 
species S. cerevisiae, S. carkbergensis, S. diastaticus, S. douglasii, S. kluyveri, S. norbensis, 
and S. oviformis. When selecting yeast host cells for expression of a desired peptide, suitable 
host cells may include those shown to have, inter alia, good secretion capacity, low 

25 proteolytic activity, and overall vigor. Yeast and other microorganisms are generally 

available firom a variety of sources, including the Yeast Genetic Stock Center, Department of 
Biophysics and Medical Physics, University of California, Berkeley, Calif.; and the American 
Type Culture Collection, Manassas VA. For a review, see Strathem et al., eds. (1981, The 
Molecular Biology of the Yeast Saccharomyces, Cold Spring Harbor Laboratory, Cold 

30 Spring Harbor, N.Y.) 

Methods of introducing exogenous DNA into yeast hosts are well known in the art. 
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2. PicMa 

The use of Pichia methanolica as a host cell for the production of recombinant 
peptides is disclosed in PCT Applications WO 97/17450, WO 97/17451, WO 98/02536, and 
WO 98/02565. DNA molecules for use in transforming P, methanolica are commonly 
5 prepared as double-stranded, circular plasmids, -which are preferably hnearized prior to 
transformatiaa. For peptide production in P. me^anolica, it is preferred that the promoter 
and terminator in the plasnrid be that of a P. methanolica gene, such as a P. methanolica 
alcohol utilization gene (AUGl or AUG2). Other useful promoters include those of the 
dihydroxyacetone synthase (DHAS), formate dehydrogeaase (FMD), and catalase (CAT) 

10 genes, as well as those disclosed in U.S. Patent No. 5,252,726. To facihtate integration of the 
DNA into the host chromosome, it is preferred to have the entire expression segment of tiie 
plasmid flanked at both ends by host DNA sequences. A preferred selectable marker for use 
in PicMa methanolica is a P. methanolica ADE2 gene, which encodes phosphoribosyl-S- 
aminoimidazole carboxylase (AIRC; EC 4.1.1.21), ^ch allows ade2 host cells to grow in 

1 5 the absence of adenine. For large-scale, industrial processes where it is desirable to minimize 
tiie use of methanol, host cells in which both methanol utilization genes (AUGl and AUG2) 
are deleted are preferred. For production of secreted peptides, host cells deficient in vacuolar 
protease genes (PEP4 and PRBl) are preferred. Electroporation is used to facihtate the 
introduction of a plasmid containing DNA encoding a peptide of interest into P. methanolica 

20 cells. It is preferred to transform P. methanolica cells by electroporation using an 

exponentially decaying, pulsed electric field having a field strength of fix)m 2.5 to 4.5 kV/cm, 
preferably about 3.75 kV/cm, and a time constant (t) of fiom 1 to 40 milliseconds, most 
preferably about 20 milliseconds. For a review of the use of Pichia pastoris for large-scale 
production of antibody fragments, see Fischer et al., (1999, Biotechnol Appl Biochem. 30 ( R 

25 2): 117-120). 

3. Aspersillus 

Methods to express peptides mAspergUliis spp, are well known in the art, including 
but not limited to those described in Carrez et al., 1990, Gene 94:147-154; Contreras, 1991, 
Bio/Technology 9:378-381; Yelton et al., 1984, Proc. Natl. Acad. Sd, USA 81:1470-1474; 
30 Tilbum et al., 1983, Gene 26:205-221; Kelly and. Hynes, 1985, EMBO J. 4:475-479; 
Ballance et al., 1983, Biochem, Biophys. Res. Comm. 112:284-289; Buxton et al ., 1985, 
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Gene 37:207-214, and U.S. Pat No. 4,935,349, incorporated by reference herein in its 
entirety. Examples of promoters useful in Aspergillus are found in U.S. Patent No. 
5,252,726. Strains of Aspergillus useful for peptide expression are found in U.S. Patent No. 
4,93 5,349. Coimnercial production of exogenous peptides is available from Novoeozymes 
5 for Aspergillus niger and Aspergilli^ oryzae. 
4. Trickoderma 

Trichoderma has certain advantages over otiier species of recombinant host cells for 
expression of desired peptides. This organism is easy to grow in large quantities and it has 
the ability to glycosylate and efficiently secrete high yields of recombinant mammalian 

10 peptides into the medium, making isolation of the peptide relatively easy. In addition, the 
glycosylation pattern on expressed peptides is more similar to that on human peptides than 
peptides expressed in other systems. However, there are still differences in the glycan 
structures on expressed pe{}tides fiom fiiese cells. For example, teaminal sialic acid residues 
are important to the therapeutic functioai of a peptide in a mammalian system, since the 

15 presence of these moides at the end of the glycan structure impedes peptide clearance fiom 
the mammahan bloodstream. The mechanism behind the increased biologic half-life of 
sialylated molecules is believed to lie in their decreased recognition by lectins (Drickamer, 
1988, J. Biol. Chem. 263:9557-9560). However, in general fungal cells do not add terminal 
sialic acid residues to glycans on peptides, and peptides synthesized in jBmgal cells are 

20 therefore asialic. According to tiae present invention, this deficiency can be remedied using 
the in vitro glycan remodeling methods of the invention described in detail elsewhere herein. 

THchoderma species useful as hosts for the production of peptides to be remodeled 
include T. reesei, such as QM6a, ALK02442 or CBS383.78 (Centraalbureau voor 
Schumnelcultures, Oosterstraat 1, PO Box 273, 3740 AG Baam, The Netherlands, or, 

25 ATCC13631 (American Type Culture Collection, Manassas VA 10852, USA, type); T. 
viride (such as CBS189.79 (det. W. Gams); T. longibrachiatum, such as CBS816.68 (type); 
T. pseudokoningii (such as MUCL19358; Mycofheque de ITJniversite CathoUque de 
Louvain); T. satumisporum CBS330.70 (type); T. harzianum CBS3 16.31 (det. W. Gams); T. 
virgatum (T. pseudokoningii) ATCC24961. Most preferably, the host is T. reesei and more 

30 preferably, it is T. reesei strains QM9414 (ATCC 26921), RUT-C-30 (ATCC 5676S), and 
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highly productive mutants such as VTT-D-79125, which is derived fiom QM9414 
(Nevalainen, Technical Research Centre of FiiJand Publications 26, (1985), Espoo, Finland). 

The transformation of Trichoderma with DNA is performed using any technique 
known m the art, mcluding that taught in European patent No. EP0244234, Harkid (1989, 
5 Bio/Technology 7:596-601) and Uusitalo (1991, J. Biotech, 17:35-50). Culture of 

Trichoderma is supported by previous extensive esxperience in industrial scale fermentatiaa 
techniques; for example, see Finkelstein, 1992, Biotechnology of Filamentous Fungi: 
Technology and Products, Butterworlii-Heiaemann, publishers, Stoneham, Mass. 
S. Kluweromyces 

10 Yeast belonging to the genus KluyvewjTtyces have been used as host organisms for the 

production of recombinant peptides. Peptides produced by this genus of yeast are, in 
particular, chymosin (European Patait 96 430), thaumatin (European Patent 96 910), 
albumin, interleukin-1 p, TP A, TIMP (European Patent 361 991) and albumin derivatives 
having a therapeutic function (European Patent 413 622). Species of particular interest in the 

15 genus Kluyveromyces iDclude K. lactis. 

Methods of expressing recombinant pq)tides in Kluyvermyces spp. are well known in 
the art. Vectors for the ej^ression and secretion of human recombinant peptides in 
Klttyvermyces are known in the art (Yeh, J. Cell. Biochem. Suppl. 14C:68, Abst. H402; Fleer, 
1990, Yeast 6 (Special Issue):S449) as are procedures for transformation and expression of 

20 recombinant peptides (Ito et al., 1983, J. Bacteriol. 153:163-168; van den Berg, 1990, 
Bio/Technology 8:135-139; U.S. Patent No. 5,633,146, WO8304050A1, EP0096910, 
EP0241435, EP0301670, EP0361991, all of Tsiiich are incorporated by reference herein in 
their entirety). For a review of genetic manipulation of Kluyveromyces lactis linear DNA 
plasmids by gene targeting and plasmid shuffles, see Schaffratb et aL (1999, FEMS Microbiol 

25 Lett. 178(2):201-210). 



6. Chrvsoporium 

The fungal genus Chrysoporium has recently been used to expression of foreign 
recombinant peptides. A description of the proceedures by which one of skill in the art can 
30 use Chrysoporium can be used to express foreign peptides is found in WO 00/20555 

(incorporated by reference herein in its entirety). Species particularly suitable for expression 
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system include, but are not limited to, C. botryoides, C. carmichaelii, C. crassitunicatum, C. 
europae, C. evolceannui, F.fasttdium, C.filiforme, C. gerogiae, C. globiferum, C. globiferum 
var. articulatum, C. globiferum var. niveum, G hirundo, C. hispanicuni, C. holmii, C. 
indicum, C. inops, C. heratinophilum, C. kreiselii, C. kuzurovianum, C Hgnomm, C 
5 hbaium, C. lucknowense, C. lucknowense Garg 27K, C. medium, C. medium var. spissescens, 
C. mephiticum, C. merdarium, C. merdarium var. roseum, C. minor, C. pamicola, C. 
parvum, Cparvum var. crescens, Cpilosum. G peodomerderium, C. pyriformis, C. 
queenslandicum, C. sigleri, C. sulfUreum, G synchromm, C. tropicum, C. undulatum, C. 
vallenareme, C. ve^ertilitm, and G zonatum. 

10 

7. Others 

Methods for transforming Schwanniomyces are disclosed in European Patent 394 
538. Methods for transfornaing j4c/"ew»c>»jMm chrysogenum are disclosed by U.S. Pat. No. 

5,162,228. Methods for transforming iVeMro^pom are disclosed by U.S. Pat. No. 4,486,533. 
1 5 Also Imow is an expression system specifically for Schizosaccharomyces pombe (European 
Patent 385 391). General methods for expressing peptides in fission yeast, 
Schizosaccharomyces pombe can be found in Giga-Hama andKumagai (1997, Foreign gene 
expression in jBssion yeast : Schizosaccharomyces pombe. Springer, Berlin). 

20 r. Matmnalian systems 

As discussed above, mammalian cells typically produce a heterogeneous mixture of 
N-glycan stnictures which vary with respect to the number and arrangement of additional 
sugars attached to the trimannosyl core. Typically, mammalian cells produce peptides having 
a complex glycan structure, such as that shown in Figure 4, right side. Using the methods of 

25 the present invention, a peptide produced in a mammalian cell may be remodeled in vitro to 
generate a peptide having desired giycosylation by first identifying the primary glycan 
structure and then determining which sugars must be ranoved in order to remodel the glycan 
structure. As discussed herein, fhe sugars to be removed will detannine which cleavage 
enzymes will be used and thus, the precise steps of the remodeling process will vary 

30 . depending on the primary glycan strocture used as the initial substrate. A sample scheme for 
remodeling a glycan structure commonly produced in mammalian cells is shown in Figure 3. 
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The N-glycaa biosynfhetic pathway in mammaliaii cells has been well characteaized 
(reviewed in Moremen, 1994, Glycobiology 4:113-125). Many of the enzymes necessary for 
glycan synthesis have been identified, and mutant cell lines defective in this enzymatic 
pathway have been isolated including the Chinese hamster ovary (CHO) cell Unes Lec23 
5 (defective in alpha-gtucosidase I) and Lecl8 (novel GlcNAc-TVIII). The glycosylation 
pattern of peptides produced by these mutant cells is altered relative to normal CHO cells. 
As discussed herein, the glycosylation defects in these and other mutant ceUs can be 
exploited for the pmposes of producing a peptide that lacks a complex glycan structure. For 
example, peptides produced by Lec23 cells lack sialic acid residues, and thus require less 

10 enzymatic manipulation in order to reduce the glycan structure to an elemental trimannosyl 
core or to Man3GlcNAc4. Thus, peptides produced in these cells can serve as preferred 
substrates for glycan remodeling. One of ordinary skill in the art could isolate or identify 
other glycosylation-defective ceU lines based on known methods, for exauiple the method 
described in Stanley et al., 1990, Somatic Cell Mol. Genet., 16: 211-223. Use of 

15 glycosylation-defective cell lines, those identified and as yet unidentified, is included in the 
invention for the purpose of generating preferred peptide substrates for the remodeling 
processes described herein. 

Expression vectors useful for expressing exogenous peptides in mammalian cells are 
numerous, and are well known to those in the art. Many mammalian expression vectors are 

20 now commercially available from companies, including Novagen, Inc (Madison, WI), Gene 
Therapy Systems (San Diego, CA), Promega (Madison, WI), ClonTech Lac. (Palo Alto, CA), 
and Stratagene (La Jolla, CA), among others. 

There are several mammalian cell lines that are particularly adept at expressmg 
exogenous peptides. Typically mammalian cell lines originate fix)m tumor cells extracted 

25 from mammals that have become immortalized, that is to say, they can replicate in culture 
essentially indefinitely. These ceU lines include^ but are not limited to, CHO (Chinese 
hamster ovary, e.g. CHO-Kl; ATCC No. CCL 61) and variants thereof NSO (mouse 
myeloma), BNK, BHK 570 (ATCC No. CRL 10314), BHKl (ATCC No. CRL 1632), 
Per.C6™ (immortalized human cells, CrucellN.V., Leiden, The Netherlands), COS-1 (ATCC 

30 No. CRL 1650), COS-7 (ATCC No. CRL 1651), HEiC 293, mouse L ceUs, T lymphoid cell 
lines, BW5147 ceUs and MDCK (Madin-Darby canine kidney), HeLa (human), A549 (human 
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lung carcinoma), 293 (ATCC No. CRL 1573; Graham et al., 1977, Gen. Virol. 36:59-72), 
BGMK (Buffalo Greea Monkey kidney), Hep-2 (human epidemioid larynx carcinoma), LLC- 
MK2 (African Green Monkey Kidney), McCoy, NCI-H292 (human pulmonary 
mucoepidennoid carcinoma tube), RD (rhabdomyosarcoma), Vero (African Green Monkey 
5 kidney), HEL (human embryonic lung), Human Fetal Lung-Chang, MRC5 (human 

embryonic lung), MRHF (human fixresildn), and WI-3 8 (human embryonic lung). Jh some 
cases, the cells in which the therapeutic peptide is expressed may be cells derived fixjm the 
patient to be treated, or they may be dadved from another related or unrelated mammal. For 
example, fibroblast cells maybe isolated from the mammal's skin tissue, and cultured and 
10 transformed in vitro. This technology is commercially available from Transkaryotic 

Therapies, Inc. (Cambridge, MA). Almost all currently used ceU lines are available from the 
American Type Culture Collection (ATCC, Manassas, VA) and BioWhittaker (Walkersville, 
Maryland). 

Mammalian cells may be transformed wilh DNA using any one of several techniques that are 

15 well known to those in the art. Such techniques include, but are not limited to, calcium 
phosphate transformation (Chen and Okayama, 1988 ; Graham and van der Eb, 1973; 
Corsaro and Pearson, 1981, Somatic Cell Gaieties 7:603), Diethylaminoethyl (DEAE)- 
dextran transfection (Fujita et al., 1986; Lopata et al., 1984; Selden et al., 1986, ), 
electroporation (Neumann et al., 1982, ; Potter, 1988, ; Potter et al., 1984, ; Wong and 

20 Neuman, 1982 ), cationic lipid reagent transfection (Ehoy-Stedn and Moss, 1990; Feigner et 
al., 1987; Rose et aL, 1991; Whitt et al., 1990; Hawley-Nelson et al., 1993, Focus 15:73; 
Ciccarone et al., 1993, Focus 15:80), rebx>viTal (Cepko et al., 1984; Miller and Baltimore, 
1986; Pear et al., 1993; Austin and Cepko, 1990; Bodine d al., 1991; Fekete and Cepko, 
1993; Lemischka et al., 1986; Turner et al., 1990; Williams et al., 1984; Miller and Rosman, 

25 1989, BioTechniques 7:980-90; Wang and Finer, 1996, Nature Med. 2:714-6), polybrene 
(Chaney ^ al, 1986; Kawai andNishizawa, 1984), microiajection (Capecchi, 1980), and 
protoplast fusion (Elassoulzadegan et al., 1982; Sandii-Goldin et al., 1981; Schaffer, 1980), 
among others. Ingeneral, see Sambrooketal. (2001, Molecular Cloning: A Laboratory 
Manual, Cold Spring Harbor Laboratory, New York) and Ausubel et al. (2002, Current 

30 Protocols in Molecular Biology, John Wiley & Sons, New York) for transformation 
techniques. 
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Recently the baculovirus system, popular for transfbimation of insect cells, has beeaa. 
adapted for stable transformation of mammalian cells (see, for review, Koat and Condreay, 
2002, Trends Biotechnol. 20:173-180, and references cited therein). The production of 
recombinant peptides in cultured mammalian cells is disclosed, for example, in U.S. Pat. Nos. 

5 4,713,339, 4,784,950; 4,579,821; and 4,656,134. Several companies offer the services of 
transformation and culture of mammalian cells, including Cell Trends, Inc. (Middletown, 
MD). Techniques for culturing mammalian cells are well known in the art, and further found 
in Hauser et al. (1997, Mammalian Cell Biotechnology. Walter de Gruyer, Inc., Hawthorne, 
NY), and Sambrook et al. (2001, Molecular Cloning: A Laboratory Manual. Cold Sprmg 

1 0 Harbor and references cited therein. 
D, Insect 

Insect cells and in particular, cultured insect cells, express peptides having N-linked 
glycan structures that are rarely sialylated and usually comprise mannose residues which may 
or may not have additional ftioose residues attached thereto. Examples of the types of glycan 

1 5 structures present on peptides produced in cultured insect cells are shown in Figure 7, and 
maraiose glycans thereof 

Baculovirus-mediated expression in insect cells has become particularly well- 
established for the production of recombinant pq)tides (Altmami et al., 1999, Glycoconjugate 
J. 16:109-123). With regard to peptide foldmg and post-translational processing insect cells 

20 are second only to mammalian cell lines. However, as noted above, N-glycosylatian of 
peptides in msect cells di^rs in many respects from N-glycJosylation in mammalian cells 
particularly in that insect cells fiequently generate truncated glycan structures comprising 
oligosaccharides containing just three or sometimes only two mannose residues. These 
structures maybe additionally substituted with fiicose residues. 

25 According to the present invention, a peptide produced in an insect cell may be 

remodeled in vitro to generate a peptide with desired glycosylation by first optionally 
removing any substituted fiicose residues using an appropriate fUcosidase enzyme. In 
instances where the peptide comprises an elemental trimannosyl core structure following the 
removal of fiicose residues, then all that is required is the in vitro addition of the appropriate 

30 sugars to the trimannosyl core structure to generate a peptide having desired glycosylation. 
In instances when the peptide might contain only two mannose residues in the glycan 
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Structure following removal of any fucose residues, a third maimose residue maybe added 
using a mannosyltran^ferase enzyme and a sidtable donor molecule such as GDP-mannose, 
and thereafter the appropriate residues are added to generate a peptide having desired 
glycosylatioa 

Protocols for the use of baculoviius to Iransform insect cells are well known to those 
in the art. Several books have been published which provide the procedures to use the 
baculovims system to express peptides in insect cells. These books include, but are not 
limited to, Richardson (Baculovirus Expression Protocols, 1998, Methods in Molecular 
Biology, Vol 39, Humana Pr), O'Reilly et al. (1994, Baculovims Expression Vectors : A 
Laboratory Manual, Oxford Univ Press), and King andPossee (1992, The Baculovirus 
Expression System : A Laboratory Guide, Chapman & Hall). In addition, there are also 
pubHcations such as Luddow (1993, Curr. Opin. Biotechnol. 4:564-572) and Miller (1993, 
Curr. Opin. Genet Dev. 3:97-1 01). 

Many patents have also been issued that related to systems for baculoviral expression 
of foreign proteins. These patents include, but are not limited to, U.S. Patent No, 6,210,966 
(Culture medium for insect cells lacking glutamine an contains ammonium salt), U.S. Patent 
No. 6,090,584 (Use of BVACs (BaculoVirus Artificial Chromosomes) to produce 
recombinant peptides), U.S. Patent No. 5,871,986 (Use of a baculovirus to express a 
recombinant nucleic acid in a mammalian cell), U.S. Patent No. 5,759,809 (Methods of 
expressing peptides in insect cells and methods of killing insects), U.S. Patent No. 5,753,220 
(Cysteine protease gene defective baculovmis, process for its production, and process for the 
production of economic peptide by using the same), U.S. Patent No. 5,750,383 (Baculovirus 
cloning system), U.S. Patent No. 5,731,182 (Non-mammaHanDNA virus to express a 
recombinant nucleic acid in a mammalian cell), U.S. Patent No. 5,728,580 (Mefliods and 
culture media for inducing single cell suspension in insect cell lines), U.S. Patent No. 
5,583,023 (Modified baculovirus, its preparation process and its application as a gene 
expression vector), U.S. Patent No. 5,571,709 (Modified baculovirus and baculovirus 
expression vectors), U.S. Patent No. 5,521,299 (OUgonucleotides for detection of baculovirus 
infection), U.S. Patent No. 5,516,657 (Baculovirus vectors for expression of secretory and 
membrane-bound peptides), U.S. Patent No. 5,475,090 (Gene encoding a peptide which 
enhances virus infection of host insects), U.S. Patent No. 5,472,858 (Production of 
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recombinant peptides in insect larvae), U.S. Patent No. 5,348,886 (Method of producing 
recombinant eukaryotic viruses in bacteria), U.S. Patent No. 5,322,774 (Prokaryotic leader 
sequence ia recombinant baculovirus expression system), U.S. Patent No. 5,278,050 (Method 
to improve the efficiency of processing and secretion of recombinant genes in insect 
5 systems), U.S. Patent No. 5,244,805 (Baculovirus expression vectors), U.S. Pateaot No. 
5,229,293 (Recombinant baculovirus), U.S. Patent No. 5,194,376 (Baculovirus expression 
system capable of producing recombinant peptides at high levels), U.S. Patent No. 5,179,007 
(Method and vector for the purtBcation of recombinant peptides), U.S. Patent No, 5,169,784 
(Bacxilovinis dual promoter expression vector), U.S. Patent No. 5,162,222 (Use of 

1 0 baculovirus early promoters for expression of recombinant nucleic acids in stably 

transformed insect cells or recombinant baculoviruses), U.S. Patent No. 5,155,037 (Tnsect 
signal sequences useful to improve the efficiency of processing and secretion of recombinant 
nucleic acids in insect systems), U.S. PatantNo. 5,147,788 (Baculovirus vectors and methods 
of use), U.S. Patent No. 5,110,729 (Method of producing peptides using baculovirus vectors 

15 in cultured cells), U.S. Patent No. 5,077,214 (Use of baculovirus early promoters for 
expression of recombinant genes in stably transformed insect cells), U.S. Patent No. 
5,023,328 (Lepidopteran AKH signal sequence), and U.S. Patent Nos. 4,879,236 and 
4,745,05 1 (Method for producing a recombinant baculovirus e3q)ression vector). AU of the 
aforementioned patentes are incoij)orated in their entirety by reference herein. 

20 Insect cell lines of several different species origin are currently being used for pe^rtide 

expression, and these lines are well known to those in the art. Insect cell lines of interest 
include, but are not limited to, dipteran and lepidopteran insect cells in general, Sf9 and 
variants thereof (fall armywonn Spodoptera frugiperda), Estigrnene acrea, Trichoplusia ni, 
Bombyx mori, Malacosoma disstri. drosophila lines Kcl and SL2 among others, and 

25 mosquito. 



E. Plants 

Plant cells as peptide producers present a different set of issues. While N-linked 
glycans produced in plants comprise a trimannosyl core structure, this pentasaccharide 
30 backbone may comprise several different additional sugars as shown in Figure 6. For 

example, in one instance, the trimannosyl core structure is substituted by a p 1,2 linked xylose 
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residue and an al,3 linked fucose residue. In addition, plant cells may also produce a 
Man5GlcNAc2 structure. Peptides produced in plant cells are often highly antigenic as a 
result of the presence of the core al,3 fijcose and xylose on the glycan structure, and are 
rapidly cleared Scorn the blood stream when introduced into a mammd due to the absence of 
5 terminal sialic acid residues. Therefore, unless these peptides axe remodeled using the 

mefliods provided herein, they are generally considered to be unsuitable as therapeutic agents 
in mammals. While some monoclonal antibodies expressed in plant cells were found to be 
non-iramunogenic in mouse, it is likely that the glycan chains were not immunogenic because 
they were buried in the Fc region in these antibodies (Chargelegue et al., 2000, Transgenic 

10 Res. 9(3): 187-194). 

Following the directions provided herein, it is now possible to generate a peptide 
produced in a plant cell wherein an increased number of the glycan structures present thereon 
comprise an elemental trimannosyl core structure^ or a Man3GlcNAc4 structure. This is 
accomplished by cleaving off any additional sugars in vitro using a combination of 

1 5 ^propriate glycosidases, including flicosidases, until the elemental trimannosyl core 

structure or the Man3GlcNAc4 structure is arrived at. These cleavage reactions should also 
include removal of any fucose or xylose residues from the structures in order to diminish the 
antigenicity of the final peptide when introduced into a mammal. Plant cells having 
mutations that inhibit the addition of fucose and xylose residues to the trimannosyl core 

20 structure are known in the art (von Schaewen et al., 1993, Plant Physiology 102: 1 109-1 118). 
The use of these cells to produce pqjtides having glycans which lack fucose and xylose is 
contemplated by the invention. Upon production of tiie elemental trimannosyl core or 
Man3GlcNAc4 structure, additional sugars may then be added thereto to arrive at a pqptide 
having desired glycosylation that is therefore suitable for therapeutic use in a mammal. 

25 Transgenic plants are considered by many to be the expression system of choice for 

pharmaceutical peptides. Potentially, plants can provide a cheaper source of recombinant 
peptides. It has been estimated that the production costs of recombinant peptides in plants 
could be between 10 to 50 times lower lhat that of producing the same peptide in E. coll 
While tiiere are slight differences in the codon usage in plants as compared to animals, these 

30 can be compensated for by adjusting the recombinant DNA sequences (see, Kusnadi et al., 
1997, Biotechnol. Bioeng. 56:473^84; Khoudi et al., 1999, Biotechnol. Bioeng. 135-143; 
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Hood et al., 1999, Adv. Exp. Med. Biol. 464:127-147). lii addition, peptide synthesis, 
secretion and post-translational modification are very similar in plants and animals, with only 
minor differences in plant glycosylation (see, Fischer et al., 2000, J. Biol. Regul. Homest. 
Agents 14: 83-92). Then, products from transgenic plants are also less likely to be 
5 contaminated by animal pathogens, nuCTobial toxins and oncogenic sequences. 

The esqwession of recombinartt peptides in plant cells is well known in the art In 
addition to transgenic plants, peptides can also produced in transgenic plant cell cultures (Lee 
et al., 1997, Mol. Cell. 7:783-787), and non-transgenic plants inoculated with recombinant 
plant viruses. Several books have been published that describe protocols for the genetic 

10 transformation of plant ceUs; Potrykus (1995, Gene transfer to plants. Springer, New York), 
Nickoloff (1995, Plant cell electroporation and electrofusion protocols, Humana Press, 
Totowa, New York) and Draper (1988, Plant genetic transformation, Oxford Press, Boston). 

Several methods are currently used to stably transform plant cells with recombinant 
genetic material. These methods iiiclude, but are not limited to, Agtobacteritm 

15 transformation (Bechtold and Pelletier, 1998; Escudero and Hohn, 1997; Hansen and Chilton, 
1999; Touraev et al., 1997), biohstics (microprojectiles) (Finer et al., 1999; Hansen and 
Chilton, 1999; Shilito, 1999), electroporation of protoplasts (Fromm et al., 1985, Ou-Lee et 
al., 1986; Rhodes et al, 1988; Saunders et al., 1989; TMck et al., 1997), polyethylene glycol 
treatment (Shilito, 1999; Trick et al., 1997), in planta mircroinjection (Leduc et al., 1996; 

20 Zhou et al., 1983), seed imbibition (Trick et al., 1997), laser beam (1996), and silicon carbide 
whiskers (Thompson et aL, 1995; U.S. Patent Appln. No. 20020100077, incorporated by 
reference herein in its entirety). 

Many kinds of plants are amenable to transformation and expression of exogenous 
peptides. Plants ofparticular interest to express the peptides to be used in the remodeling 

25 method of the invention include, but are not limited to, Arabidqpsis thalliana, rapeseed 
(Brassica spp.; Ruiz and Blumwald, 2002, Planta 214:965-969)), soybean {Glycine max), 
sunflower (Helianthus unnuus), oil pahn (Elaeis guineeis), groundnut (peanut, ^racAw 
hypogaea; Deng et al., 2001, Cell. Res. 11:156-160), coconut (Cocus nucifera), castor 
(Ricinus communis), safElower {Carthamus tinctorius), mustard (Brassica spp. and Smapis 

30 alba), coriander, (CorzanJrwm sativum), squash {Cucurbita maxima; Spencer and Snow, 
2001, Heredity 86(Pt 6):694-702), linseed/flax {Limim usitatissimum; Lamblin et al., 2001, 
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Physiol Plant 112:223-232), Brazil nnt (Bertholletia exce&a), jojoba (Simmondsia chinensis), 
maize (Zeamays; Hood et al., 1999, Adv. Exp. Med. Biol. 464:127-147; Hood et al., 1997, 
Mol. Breed. 3:291-306; Petolino et al., 2000, Transgenic Research 9:1-9), alfalfa (Khoudi et 
al., 1999, Biotechnol. Bioeng. 64:135-143), tobacco (Nicotiam tabacum; Wright et al., 
Traosgenic Res. 10:177-181; Frigerio et al„ 2000, Plant Physiol. 123:1483-1493; Cramer et 
al., 1996, Ann. New York Acad. Sci. 792:62-8-71; Cabanes-Macheteau et al., 1999, 
Glycobiology 9:365-372; Ruggiero et al., 2000, FEBS Lett 469:132-136), canola (Bai et al., 
2001, Biotechnol. Prog. 17:168-174; Zhang et al., 2000, J, Anim. Sci. 78:2868-2878)), potato 
(Tacket et al., 1998, J. Meet. Dis. 182:302-305; Richter et al., 2000, Nat. Biotechnol. 
18:1167-1171; Chong et al., 2000, Transgenic Res. 9:71-78), alfalfa (Wigdorovitz et al., 
1999, Virology 255:347-353), Pea (Pisum sativum; Perrin et al., 2000, Mol. Breed. 6:345- 
352), rice (Oryza sativa ; Stoger et al., 2000, Plant Mol. Biol. 42:583-590), cotton 
(Gossypium hirsutum; Komyeyev et al., 2001, Physiol Plant 113:323-331), barley (Hordeum 
vulgare; Petersen et al., 2002, Plant Mol Biol 49:45-58); wheat (Triticum spp.; Pellegrineschi 
et al., 2002, Genome 45:421-430) and bean (Vicia spp.; Saalbach et al., 1994, Mol Gen Genet 
242:226-236). 

If expression of the recombinant nucleic acid is desired in a whole plant rather than in 
cultured oeUs, plant cells are first traasformed witii DNA encoding the peptide, following 
which, the plant is regenerated. This involves tissue culture procedures that are typically 
optimized for each plant species. Protocols to regenerate plants are ahready well known in the 
art for many species. Furthenhore, protocols for other species can be developed by one of 
skill in the art using routine ejqperimentation. Numerous laboratory manuals are available 
that describe procedures for plant regeneration, including but not limited to. Smith (2000, 
Plant tissue culture : techniques and experiments. Academic Press, San Diego), Bhojwani and 
Razdan (1996, Plant tissue culture : theory and practice, Elsevier Science Pub., Amsterdam), 
Islam (1996, Plant tissue culture, Oxford & IBH Pub. Co., New Delhi, India), Dodds and. 
Roberts ( 1995, Experiments in plant tissue culture^ New York : Cambridge University Press, 
Cambridge England), Bhojwani (Plant tissue cultoe : applications and limitations, Elsevier, 
Amsterdam, 1990), Trigiano and Gray (2000, Plant tissue culture concepts and laboratory 
exercises,. CRC Press, Boca Raton, Fla), and Lindsey (1991, Plant tissue culture manual : 
fundamentals and applications, Kluwer Academic, Boston). 
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While purifying recombinant peptides fixwn plants may potentially be costly, several 
systems have been developed to miniinize these costs. One method directs the synthesized 
peptide to the seed endosperm JBrom where it can easily extracted (Wright et al., 2001, 
T^genic Res. 10:177-181, Guda et a., 2000, Plant Cell Res. 19:257-262; and U.S. Patent 
5 No. 5,767,379, which is incorporated by reference herein in its entirety). An alternative 
approach is the co-extraction of the recombinant peptide with conventional plant products 
such as starch, meal or oil. In oil-seed rape, a fusion peptide of oleosin-hurudin vfhm 
expressed in the plant, attaches to the oil body of the seed, and can be extracted j&om the 
plant seed along with the oil (Parmenter, 1995, Plant Mol. Biol. 29:1167-1180; U.S. Patent 
10 Nos. 5,650,554, 5,792,922, 5,948,682 and 6,288,304, and US apphcation 2002/0037303, all 
of whidi are incorporated in their entirely by reference herein). In a variation on this 
approach, tiie oleosin is fused to a peptide having affinity for the exogenous co-expressed 
peptide of interest (U.S. Patent No. 5,856,452, incorporated by reference herein in its 
aitirety). 

15 Expression of recombinant peptides m plant plastids, sach as the chloroplast, 

generates peptides having no glycan stmctures attached thereto, similar to the situation in 
prokaryotes. However, the yield of such peptides is vastly greater when expressed in these 
plant cell organelles, and thus this type of escpression system may have advantages over other 
systems. For a general review on the technology for plastid expression of exogenous peptides 

20 in higher plants, see Hager and Beck (2000, Appl. Microbiol. Biotechnol. 54:302-310, and 
references cited therein). Plastid expression has been particularly successfiil in tobacco (see, 
for example, Staub et al., 2000, Nat. Biotechnol. 18:333-338). 
F. Transgenic animals 

Introduction of a recombinant DNA mto the fertilized egg of an animal (e.g., a 
25 mammal) may be accomplished using any number of standard techniques in transgenic 

animal technology. See, e.g., Hogan et al.. Manipulating the Mouse Embryo: A Laboratory 
Manual, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1986; and U.S. 
Pat. No. 5,81 1,634, which is incorporated by reference herdn in its entirety. Most 
commonly, the recombinant DNA is introduced into the embryo by way of pronuclear 
30 microinjection (Gordon et al., 1980, PNAS 77:7380-7384; Gordon and Ruddle, 1981, Science 
214:1244-1246; Brinster et al., 1981, Cell 27:223-231; Cdstantini and Lacy, 1981, Nature 
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294:92-94). Microinjection has the advantage of being applicable to a wide variety of 
species. Preimplantation embryos may also be transformed with retroviruses (Jaenisch and 
Mintz, 1974, Proc. Natl. Acad. Sci. U.S.A. 71:1250-1254; Jaenisch et al., 1976, Hamatol 
Bluttransfus. 19:341-356; Stuhhnann et al., 1984, Proc, Natl. Acad. Sci. U.S.A. 81:7151- 
5 71 55). Retroviral mediated transfonnation has the advantage of adding single copies of the 
recombinant nucleic acid to the cell, but it produces a high degree of mosaicism. Most 
recently, embryonic stem cell-mediated techniques have been used (Gossler et al., 1986, 
Proc. Natl. Acad. Sci. U.S,A.. 83:9065-9069), transfer of entire chromosomal segments 
(Lavitrano et al, 1989, Cell 57:717-723), and ^mete transfection in conjimction with in vitro 

10 fertilization (Lavitcano et al., 1989, Cell 57:717-723) have also been used. Several books of 
laboratory procedures have been published disclosing these techniques: Cid-Arregui and 
Garcla-Carranc4 (1998, Microinjection and Transgenesis : Strategies and Protocols, Springer, 
Berlin), Clarke (2002, Transgenesis Techniques : Principles and Protocols, Humana Press, 
Totowa, NJ), and Pinkert (1994, Transgenic Animal Technology : A Laboratory Handbook, 

15 Academic Press, San Diego). 

Once the recombinant DNA is introduced into the egg, the egg is incubated for a short 
period of time and is then transferred into a pseudopregnant animal of the same species from 
which the egg was obtained (Hogan et aL, supra). In the case of mammals, typically 125 
eggs aire injected per experiment, approximately two-thirds of which wiU survive the 

20 procedure. Twenty viable eggs are transferred into a pseudopregnant mammal, four to ten of 
which will develop into hve progeny. Typically, 10-30% of the progeny (m the case of mice) 
carry the recombinant DNA. 

While the entire animal can be used as an expression system for the peptides of the 
invention, m a preferred embodiment, the exogenous peptide accumulates m products of the 

25 animal, from which it can be harvested without injury to the animaL Liprefened 

embodknents, the exogenous peptide accumulates m milk, eggs, hair, blood, and urine. 

If the recombinant peptide is to be accumulated in the milk of the animal, suitable 
mammals are ruminants, ungulates, domesticated mammals, and dairy animals. Particularly 
preferred animals are goats, sheep, camels, cows, pigs, horses, oxen, and llamas. Methods for 

30 generating transgenic cows that accumulate a recombiuant peptide in then milk are well 
known: see, Newton (1999, J. hmnunol. Methods 231:159-167), Ebert et aL (1991, 
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Biotechnology 9: 835-838), and U.S. Patent Nos. 6,210,736, 5,849,992, 5,843,705, 
5,827,690, 6,222,094, all of which are incorporated hearein by reference in their entirety. The 
generation of transgenic mammals that produce a desired recombinant peptide is 
commercially available from GTC Biotherapeutics, Framingham, MA. 

If the recombinant peptide is to be accmaulated in eggs, suitable birds include, but are 
not limited to, chickens, geese, and turkeys. Other animals of interest include, but are not 
limited to, other species of avians, fish, reptiles and amphibians. The introduction of 
recombinant DNA to a chicken by retroviral transformation is well known in the art: 
Thoraval et al. (1995, Transgenic Research 4:369-376), Bossehnan et al., (1989, Science 243: 
533-535), Petropoulos etal. (1992, J. Virol. 66: 3391-3397), U.S. Patent No. 5,162,215, 
incorporated by reference herein in its entirety. Successful transformation of chickens with 
recombinant DNA also been achieved wherein DNA is introduced into blastodermal cells 
and blastodermal cells so transfected are mtroduced into the embryo: Brazolot et al. (1991, 
Mol. Reprod. Dev. 30: 304-312). Eraser, et al. (1993, Int. J. Dev. BioL 37: 381-385), and 
Petitte et al. (1990, Development 108: 185-189). High throughput technology has been 
developed to assess whether a transgenic chicken expresses the desired peptide (Harvey et al., 
2002, Poult. Sci. 81:202-212, U.S. Patent No. 6,423,488, incorporated by reference herein in 
its entirety). Using retroviral transformation of chicken with a recombinant DNA, exogenous 
beta-lactamase was accumulated in the egg white of the chicken (Harvey et al., 2002, Nat. 
Biotedmol. 20(4):396-399). The production of chickens producing exogenous pqptides hi 
egg is commercially available from AviGenics, Inc., Athens GA- 
G. Bacteria 

Recombinantly expressed peptides produced in bacteria are not generally 
glycosylated. However, bacteria systems capable of glycosylating peptides are becoming 
evident and therefore it is likely that glycosylated recombinant peptides may be produced hi 
bacteria in tiie future. 

Numerous bacterial expression systems are known in the art Preferred bacterial 
species include, but are not limited to, E.coli and Bacillus species. 
The expression of recombinant peptides m E. coli is well known in the aft. Protocols for E. 
coZz-based expression systems are found in U.S. ApphiNo. 20020064835, U.S. Patent Nos. 
6,245,539, 5,606,031, 5,420,027, 5,151,511, audRE33,653, among others. Methods to 
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transfomi bacteria include, but are not limited to, calcium chloride (Cohen et al., 1972, Proc. 
Natl Acad. Sci. U.S.A.. 69:2110-2114; Hanahan, 1983, J. Mol. Biol. 166:557-580; Mandel 
and Higa, 1970, J. Mol. Biol. 53:159-162) and electroporation (Shigekawa and Dower, 1988, 
Biotechniques 6:742-751), and those described in Sambrook et al, 2001 (supra). For a 
5 review of laboratory protocols on microbial transformation and espression systems, see 
Samders and Saunders (1987, Microbial Genetics Applied to Biotechnology : Principles and 
Techniques of Gene Transfer and Manipulation, Groom HeUn, London), Ptflder (1993, 
Genetic Engineering of Micxroorganisms, Weinheim, New York), Lee et al., (1999, Metabolic 
Engineering, Marcel Dekker, New York), Adolph (1996, Microbial Genome Methods, CRC 

1 0 Press, Boca Raton), and Birren and Lai (1996, NonmammaHan Genomic Analysis : A 
Practical Guide, Academic Press, San Diego), 

For a general review on the literature for peptide expression in E. coli see Balbas 
(2001, Mol. Biotechnol. 19:251-267). Several companies now offer bacterial strains selected 
for the expression of mammalian pqptides, such as the Rosetta™' strains of jE. coli (Novagen, 

15 inc., Madison, WI; with enhanced expression of eukaryotic codons not normally used in 
bacteria cells, and enhanced disulfide bond formation), 
H. Cell engineering 

It will be apparent from the present disclosure that the more uniform the starting 
material produced by a cell, the more efficient will be the generation in vitro of large 

20 quantities of peptides having desired glycosylation. Thus, the g^etic engineering of host 
cells to produce uniformly glycosylated peptides as starting material for the in vitro 
enzymatic reactions disclosed herdn, provides a significant advantage over using a peptide 
starting material having a heterogeneous set of glycan structures attached thereto. One 
preferred peptide starting material for use in the present invention is a peptide having 

25 primarily glycan molecules which consist solely of an elemental trimannosyl core structure. 
Another preferred starting material is Man3GlcNAc4. Following the remodehng process, the 
preferred peptides will give rise to the greatest amount of peptides having desired 
glycosylation, and thus improved clinical efficacy. However, other glycan starting material is 
also suitable for use in the methods described herein, in tiiat for example, high mannose 

30 glycans may be easily reduced, in vitro, to elemental trimannosyl core structures using a 
series of mamosidases. As described elsewhere herein, other glycan starting material may 



-288- 



wo 03/031464 



PCT/US02/32263 



also be used, provided it is possible to cleave off all extraneous sugar moieties so that the 
elemental trimaimosyl core structure or Man3GlcNAc4 is generated. Thus, the purpose of 
using genetically engineered cells for the production of the peptides of the present invention 
is to generate peptides having as uniform as possible a glycan structure attached thereto, 

5 wherein the glycan structure can be remodeled in vitro to generate a peptide having desired 
glycosylation. This will result in a dramatic reduction in production costs of these peptides. 
Since the glycopeptides produced using this methodology will predominantly have the same 
N-linked glycan structure, the post-production modification protocol can be standardized and 
optimized to produce a greater batch-to-batch consistency of final product. As a result, the 

1 0 final completed-chain products may be less heterogeneous than those presentiy available. 
The products will have an improved biological half-Hfe and bioactivity as compared to the 
products of the prior art. Alternatively, if desired, the hivention can be used to introduce 
limited and specific heterogeneity, e.g., by choosing reaction conditions that result in 
differential addition of sugar moieties. 

15 Preferably, though not as a rigid requirement, the genetically engineered cell is one 

which produces peptides having glycan stractures comprised primarily of an elemental 
trimannosyl core structure or Man3GlcNAc4. At a mmimmn, the proportion of these 
preferred stmctures produced by the genetically engineered cell must be enough to yield a 
peptide having desired glycosylation following Ihe remodeling protocol. 

20 In general, any eukaryotic cell type can be modified to become a host cell of the 

present invention. Fnst, the glycosylation pattern of both endogenous and recombinant 
glycopeptides produced by the orgaoism are determined in order to identify suitable 
additions/deletions of enzymatic activities that result in the production of elemental 
trimannosyl core glycopeptides or Man3GlcNAc4 glycopeptides. This will typically entail 

25 deletmg activities that use trimannosyl glycopeptides as substrates for a glycosytoansferase 
reaction and inserting enzymatic activities that degrade more complex N-linked glycans to 
produce shorter chains. In addition, genetically engineered cells may produce high mannose 
glycans, which maybe cleaved by mannosidase to produce desired starting glycan structures. 
The mannosidase maybe active in vrvo in the cell (i.e., Ihe cell may be genetically engineered 

30 to produce them), or they may be used in in vitro post production reactions. 
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Techniques for gendically modifying host cells to alter the ^ycosjdation profile of 
expressed peptides are well-known. See, e.g., Altmann et al. (1999, Glycoconjugate J, 16: 
109-123), Ailor et al. (2000, Glycobiology 10(8): 837-847), Jarvis et al., (In vitrogen 
Conference^ March, 1999, abstract), HoUister and Jarvis, (2001, Glycobiology 11(1): 1-9), 
and Palacpac et al., (1999, PNAS USA 96: 4697), Jarvis et al., (1998. Curr. Opin. Biotechnol. 
9:528-533), Gemgross (U.S. Patent Publication No. 20020137134), all of which disclose 
techniques to "mammalianize" insect or plant cell expression sj^ems by ttansfecting insect 
or plant cells with glycosyltransferase genes. 

Techniques also exist to genetically alter the glycosylation profile of peptides 
e3q)ressed in£. coH. E. coli has been engineered with various glycosyltransferases fixim the 
bacteria iVeisjerta meningitidis and Azorhizobitm to produce oligosaccharides in vivo (Bettler 
et al., 1999, Glycoconj. J. 16:205-212). E. coli which has been genetically engineered to 
over-ejq)ress Neisseria meningitidis |31,3 N aceffyl glucoaminyltransferase IgtA gene will 
efficiently glycosylate exogenous lactose (Priem et al., 2002, Glycobiology 12:235-240). 

Fungal cells have also been genetic^y modified to produce exogenous 
glycosyllxansferases (^oshida et al., 1999, Glycobiology, 9(l);53-58; Kalsner et al., 1995, 
Glycoconj. J. 12:360-370; Schwientek and Ernst, 1994, Gene 145(2):299-303; Chiba et al, 
1995, Biochem J. 308:405409). 

Thus, in one aspect, the present invention provides a cell that glycosylates a 
glycopeptide population such that a proportion of glycopeptides produced therday have an 
elemental trimannosyl core or a Man3GlcNAc4 structure. Preferably, the cell produces a 
peptide Having a glycan stracture comprised solely of an elemental trimannosyl core. At a 
minimum, the proportion of peptides having an elemental trimannosyl core or a 
Man3GlcNAc4 structure is enou^ to yield peptides having desired glycosylation following 
the remodeling process. The cell has introduced into it one or more heterologous nucleic acid 
esscpression units, each of which may comprise one or more nucleic acid sequences encoding 
one or morc peptides of uiterest. The natural form of the glycopeptide of interest may 
comprise one or more complex N-linked glycans or may simply be a high mannose glycan. 

The cell may be any type of cell and is preferably a eukaryotic cell. The cell maybe a 
mammalian cell such as human, mouse, rat, rabbit, hamster or other type of mammalian cell. 
When the cell is a mammalian cell, the mammahan cell may be derived fiom or contained 
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within a non-human transgenic mammal wliere the cell in the mammal encodes the desired 
glycopeptide and a variety of glycosylating and glycosidase enzymes as necessary for the 
production of desired glycopeptide molecules. In addition, the cell may he a fungal cell, 
preferably, a yeast cell, or the cell may be an insect or a plant cell. Similarly, when the ceU is 
5 a plant cell, the plant cell may be derived from or contained within a transgenic plant, 
wherein the plant encodes the desired glycopeptide and a variety of glycosylating and 
glycosidase enzymes as are necessary for the production of desired glycopeptide molecules. 

In some embodiments the host cell may be a eukaryotic cell expressing one or more 
heterologous glycosyltransferase enzymes and/or one or more heterologous glycosidase 

1 0 enzymes, wherein expression of a recombinant glycopeptide in the host cell results in the 
production of a recombinant glycopeptide having an elemental trimannosyl core as the 
primary glycan structure attached thereto. 

In some emhodiinents the heterologous glycosyltransferase enzyme useful in the cell 
maybe selected from a group consisting of any known glycosyltransferase enzyme included 

15 for example, in the list of Glycosyltransferase Families available in Taniguchi et al. (2002, 
Handbook of Glycosyltransferases and Related Genes, Springer, New York). 

In other embodiments, the heterologous glycosylase enzyme may be selected from a 
group consisting of mannosidase 1, mannosidase 2, mannosidase 3, and other mannosidases, 
including, but not limited to, microbial mannosidases. Additional disclosure regarding 

20 enzymes usefiil in the present invention is provided elsewhere herein. 

In yet other embodiments, the host cell may be a eukaryotic cell wherein one or more 
endogenous glycosyltransferase enzymes and/or one or more endogenous glycosidase 
enzymes have been inactivated such that expression of a recombinant glycopeptide in the 
host cell results in the production of a recombinant glycopeptide having an elemental 

25 trimannosyl core as the primary glycan structure attached thereto. 

In additional embodiments, the host cell may express heterologous 
glycosyltransferase enzymes and/or glycosidase enzymes while at the same time one or more 
endogenous glycosyltransferase enzymes and/or gtycosidase enzymes are inactivated. 
Endogenous glycosyltransferase enzymes and/or glycosidase enzymes maybe inactivated 

30 usuig any technique known to those skilled in the art including, but not limited to, antisense 
techniques and techniques involving insertion of nucleic acids into the genome of the host 
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cell. In some embodimetits, fibie endogenous enzymes may be selected fiom a group 
consisting of GnT-I, a selection of manuosidases, xylosyltransferase, core al,3 
fucosyltransferase, serine/threonine 0-mannosyltransjferases, andUie like. 

Alternatively, an expression system that naturally glycosylates peptides such that the 
5 N-linked glycans are predominantly the trimannosyi core type, or the Maii3GlcNAc4 type, 
can be exploited. An example of a cell type that produces the trimannosyi core is Sf9 ceUs. 
Other such expression systems can be identified by analyzing glycopeptides that are naturally 
or recombinantly expressed in cells and selecting those which exhibit the desired 
glycosylation characteristics. The invention should be construed to include any and all such 
10 cells for the production of the peptides of the present invention. 

V. Purification of glycan remodeled and/or glvcoconjueated peptides 

tf the modified glycoprotein is produced intracellularly or secreted, as a first step, the 
particulate debris, either host cells, lysed fiiagments, is removed, for example, by 

15 centrifiigation or ultrafiltration; optionally, the protein maybe concentrated with a 

commercially available protein concentration filter, followed by separating the peptide 
variant from other impurities by one or more steps selected fi:om immunoaffinity 
chromatography, ion-exchange column fractionation {e.g., on diethylaminoefhyl (DEAE) or 
matiices contaimng carboxymethyl or sulfopropyl groups), chromatography on Blue- 

20 Sepharose, CM Blue-Sepharoses MONO-Q, MONO-S, lentil lectin-SephaTose^ WGA- 
Sepharose, Con A-Sepharose, Ether Toyopearl, Butyl Toyopearl, Phenyl Toyopearl, or 
protein A Sepharose, SDS-PAQB chromatography, silica chromatography, 
chromatofocusing, reverse phase HPLC (RP-HPLC), gel filtration using, e.g., Sephadex 
molecular sieve or size-exclusion chromatography, chromatography on columns that 

25 selectively bind the peptide, and efhanol, pH or ammonium sulfete precipitation, membrane 
filtration and various techniques. 

Modified peptides produced in culture are usually isolated by initial extraction fix)m 
cells, enzymes, etc., followed by one or more concentration, salting-out, aqueous ion- 
exchange, or size-exclusion chromatography steps. Additionally, the modified glycoprotein 

30 may be purified by affinity chromatography. Then, HPLC maybe employed for final 
purification steps. 
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A protease inhibitor, e.g., phenylmethylsulfonylfluoride (PMSF) may be included in 
any of ttie foregoing steps to inhibit proteolysis and antibiotics may be included to prevent 
the growth of adventitious contaminants. 

Within another embodiment, supematatits from systems which produce the modified 
5 peptide of the invention are first concentrated using a commCTcially available protein 
concentration filter, for example, an Amioon or Millipore Pellicon ultrafiltration unit. 
Following the concentration step, the concentrate may be applied to a suitable purification 
matrix. For exanople, a suitable affinity matrix may comprise a ligand for the peptide, a lectin 
or antibody molecule bound to a suitable support. Alternatively, an anion-exchange resin 

10 may be employed, for example, a matrix or substrate havmg pendant DEAE groups. Suitable 
matrices include acrylamide, agarose, dextran, cellulose, or other types commonly employed 
in protein purification. Alternatively, a cation-exchange step may be employed. Suitable 
cation exchangers include various insoluble matrices comprising sulfopropyl or 
carboxymethyl groups.- Sulfopropyl groups are particularly preferred. 

15 Then, one or more RP-HPLC steps employing hydrophobic RP-HPLC media, e.g., 

silica gel having pendant methyl or other aliphatic groups, may be employed to further purify 
a peptide variant composition. Some or all of the foregoing purification steps, in various 
combinations, can also be employed to provide a homogeneous modified glycoprotein. 

The modified peptide of the inveation resulting fix)m a large-scale fermentation may 

20 be purified by methods analogous to those disclosed by Urdal et al, J. Chwmatog. 296: 171 
(1984). This reference describes two sequential, RP-HPLC steps for purification of 
recombinant human JL-2 on a preparative HPLC columa Altematively, techniques such as 
affinity chromatography may be utilized to purify the modified glycoprotein. 

25 VI. Preferred Peptides and Nucleic Acids Encoding Preferred Peptides 

The present invention includes isolated nucleic acids encoding various peptides and 
proteins, and similar molecule or firagments thereof. Such peptides include, but are not 
lunited to human granulocyte colony stimulating factor (G-CSF), human interferon alpha 
(IFN-alpha), human uiterferon beta (IFN-beta), human Factor Vn (Factor VH), human Factor 

30 DC (Factor IX), human follicle stimulating hormone (FSir>, human erythropoietin (EPO), 
human granulocyte/macrophage colony stimulating factor (GM-CSF), human interferon 
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gamma (IFN-ganmm), humaa alpha-l -protease inhibitor (also known as alpha-l-antitrypsin 
or alpha- 1 -trypsin inhibitor; A-l-PI), glucocerebrosidase, human tissue-type activator (IPA), 
hviman interleuldn-2 (JL-2), humaii Factor Vm (Factor Vm), a 75 kDa tumor necrosis factor 
receptor fused to a human IgG inununoglobulin Fc portion, commercially known as 
5 BNBRELTM or ETANERCEPTtm (chimeric TNFR), humaa urokinase (urokinase), a Fab 
fragment of the human/mouse chimeric monoclonal antibody that specifically binds 
glycoprotein Hb/ Ilia and ttie vitronectin alphav betas receptor, known commercially as 
REOPRO"™ or ABCEXIMAB (chimeric anti-glycoprotein Db/IIIa), a mouse/human chimeric 
monoclonal antibody that specifically binds human HER2, known commercially as 

10 HERCEPTHsF'^ (chimeric anti-HER2), a human/mouse chimeric antibody that specifically 
binds the A antigenic site or the F protein of respiratory syncytial virus commercially known 
as SYNAGIS™ or PALIVEUMAB (chimeric anti-RSV), a chimeric human/mouse 
monoclonal antibody that specifically binds CD20 on human B-cells, known commercially as 
RTTUXAN™ or RITUXAMAB (chimeric anti-CD20), human recombinant DNase (DNase), 

15 a chimeric human/mouse monoclonal antibody that specifically binds human tumor necrosis 
factor, known commercially as REMICADE™ or INFLIXIMAB (chimeric anti-TNF), 
human insulin, the surfece antigen of a hepatitis B vmis (adw subtype; HBsAg), and human 
growth hormone (HGH), and the like. 

The isolated nncleic acid of the invention should be construed to include an RNA or a 

20 DNA sequence encoding any of the above-identified peptides of the invenfion, and any 

modified forms thereof, including chemical modifications of 1h.e DNA or RNA which render 
the nucleotide sequence more stable when it is cell free or when it is associated witii a cell. 
As a non-limiting example, oligonucleotides which contain at least one phosphorothioate 
modification are known to confer upon the oligonucleotide enhanced resistance to nucleases. 

25 Specific examples of modified oligonucleotides include those which contain 

phosphorothioate, phosphotriester, methyl phosphonate, short chain alkyl or cyctoalkyl 
intersugar linkages, or short chain heteroatomic or heterocyclic intersugar ("backbone") 
hnkages. In addition, oligonucleotides having morpholino backbone structures (U.S. Patent 
No; 5,034,506) or polyamide backbone structures (Nielsen et al., 1991, Sd^nce 254: 1497) 

30 may also be used. 
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Cheatnical modificatioos of nucleotides may also be used to eaohance the efficiency 
with which a nucleotide sequence is taken up by a cell or the efficiency with which it is 
expressed in a cell. Any and all combinations of modilScations of the nucleotide sequences 
are contemplated in the present invention. 

The present invention should not be constraed as being limited solely to the nucleic 
and amino acid sequences disclosed herein. As described in more detail elsewhere herein, 
once armed with the present invention, it is readily apparent to one skilled in the art that other 
nucleic acids encoding the peptides of the present invention can be obtained by following the 
procedures described herein (e.g., site-directed mutagenesis, frame shift mutations, and the 
like), and procedures that are well-known in the art. 

Also included are isolated nucleic acids encoding fragments of peptides, wherein the 
peptide fragments retain the desired biological activity of the peptide, lii addition, although 
• exemplary nucleic adds encoding prefstied peptides are discbsed herein m relation to 
specific SEQ ID NOS, the invention should in no way be construed Id be limited to any 
specific nucleic acid disclosed herein. Rather, the invention should be construed to include 
any and all nucleic acid molecules having a sufficient percent identity with the sequences 
disclosed herein such that these nucleic acids also encode a peptide having the desired 
biological activity disclosed herein. Also contemplated are isolated nucleic acids that are 
shorter than fliU length nucleic acids, wherem the biological activity of the peptide encoded 
thereby is retained. Methods to determine 1h© percent identity between one nucbic add and 
another are disclosed elsewhere herein as"are assays for the determination of the biological 
activity of any specific preferred peptide. 

Also as disclosed elsewhere herein, any other number of procedures maybe used for 
the generation of derivative, mutant, or variant forms of the peptides of the present invention 
using recombinant DNA methodology weU known in the art such as, for example, that 
described in Sambrook et al. (1989, Molecular Qoning: A Laboratory Manual, Cold Spring 
Harbor Laboratory Press, New York) and Ausubd et al. (1997, Current Protocols in 
Molecular Biology, Green & Wiley, New York). Procedures for the introduction of amino 
acid changes in a peptide or polypeptide by altering the DNA sequence encoding the peptide 
are well known in the art and are also described in Sambrook et al. (1989, supra); Ausubel et 
al. (1997, supra). 
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' The inventioii includes a nucleic acid encoding a G-CSF, BFN-a^ha, IFN-beta, Factor 
Vn, Factor IX, FSH, EPO, GM-CSF, IFN-gamma, A-l-PI, glucocerebrosidase, TPA, IL-2, 
Factor VDI, chimeric TNFR, urokinase, chimeric anti-glycoprotein nb/IIa, chimeric anti- 
HER2, chimeric anti-RS V, chimeric aati-CD20, DNase, chimeric anti-THF , human insuliti, 

5 HBsAg, and HGH, wheareia a nucleic acid encoding a tag peptide is covalently linked th^eto. 
That is, the invention encompasses a chimeric nucleic acid wfaearein the nucleic acid sequence 
encoding a tag peptide is covalently linked to the nucleic acid encoding a peptide of the 
present invention. Such tag peptides are well known in the art and include, for instance, 
green fluorescent protein (GFP), myc, myc-pyruvate kinase (myc-PK), Hise, maltose binding 

1 0 protein O^^P), an influenza virus hemagglutimn tag polypeptide, a flag tag polypeptide 
(FLAG), and a glutathione-S -transferase (GST) tag polypeptide. However, the invention 
should in no way be construed to be limited to the nucleic acids encoding the above-listed tag 
peptides. Rather, any nucleic acid sequence encoding a peptide which may fimction in a 
manner substantially similar to these tag peptides should be construed to be included in the 

15 present invention. 

The nucleic acid comprising a nucleic acid encoding a tag peptide can be used to 
localize a peptide of the present invention within a cell, a tissue, and/or a whole organism 
(e.g,, amammalian embryo), detect a peptide of the present invention secreted from a cell, 
and to study the role(s) of the peptide in a cell. Further, addition of a tag peptide facilitates 

20 isolation and purification of the "tagged" peptide such that the peptides of the invention can 
be produced and purified readily. 

The invention includes the following-prefeared isolated peptides: G-CSF, IFN-alpha, 
IFN-beta, Factor VO, Factor DC, FSH, EPO, GM-CSF, IFN-gamma, A-l-PI, 
glucocerebrosidase, TPA, IL-2, Factor VIII, chimeric TNFR, urokinase, chimeric anti- 

25 glycoproteiii Ilb/ina, chimeric anti-HER2, chimeric anti-RSV, chimeric anti-CD20, DNase, 
chimeric anti-TNF, human msulin, HBsAg, and HGH. 

The present mvention should also be construed to encompass "derivatives," 
"mutants", and "variants" of the peptides of the invention (or of the DNA encoding the same) 
which derivatives, mutants, and variants are peptides which are altered in one or more amino 

30 acids (or, when referring to the nucleotide sequence encoding the same, are altered in one or 
more base pairs) such that the resulting peptide (or DNA) is not identical to the sequences 
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recited herein, but has Ihe same biological property as the peptides disclosed herein, in that 
the peptide has biological/biocheaiical properties of G-CSF, BFN-alpha, IFN-beta, Factor VII, 
Factor DC, FSH, EPO, GM-CSF, IFN-gamma, A-l-PI, glucocerebrosidase, TPA, IL-2, Factor 
VHt, chimeric TKFR, urokinase, chimeric anti-glycoproteiti Hb/IHa, chimeric anti-HER2, 
5 chimeric anti-RSV, chimeric anti-CD20, DN^e, chimeric anti-THF, human insulin, HBsAg, 
andHGH. 

Further included are fragments of peptides that retain Ihe desired biological activity of 
the peptide irrespective of the lengtii of the pqptide. It is well within the skill of the artisan to 
isolate smaller than full length forms of any of the peptides useful iu the iuvention, and to 

1 0 determine, using the assays provided herein, which isolated fragments retain a desired 
biological activity and are therefore useful peptides in the invention. 

A biological property of a protein of the present invention should be construed to 
includes, but not be limited to include the ability of the peptide to function in the biological 
assay and environments described herein, such as reduction of inflammation, elicitation of an 

15 immune response, blood-clotting, increased hematopoietic output, protease inhibition, 

immune system modulation, binding an antigen, growth, alleviation of treatment of adisease^ 
DNA cleavage, and the like. 

A. G-CSF 

20 Hie present invmtion encompasses a method for the modification of the glycan 

structure on G-CSF. G-CSF is well known in the art as a cytokine produced by activated T- 
cells, macrophages, endothelial cells, and stromal fibroblasts. G-CSF primarily acts on the 
bone marrow to increase the production of inflammatory leukocytes, and further functions as 
an endocrine hormone to initiate the replenishment of neutrophils consumed during 

25 inflammatory functions. G-CSF also has clinical applications in bone marrow replacement 
following chemotherapy. 

While Cr-CSF has been shown to be an important and useful compound for 
therapeutic applications in mammals, especially humans, present methods for the production 
of G-CSF from recombinant cells results in a product having a relatively short biological life, 

30 an inaccurate glycosylation pattern that could potentially lead to immunogenicity, loss of 
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function, and an increased need for both larger and more ftequort doses in order to achieve 
the same effect, and the like. 

G-CSF has been isolated and cloned, the nucleic add and amino acid sequences of 
which are presented as SEQ ID NO:l and SEQ ID N0:2, respectively (Figure 52 A and 52B, 
5 respectively). The present invention encompasses a method for modifying G-CSF, 
particularly as it relates to the ability of G-CSF to function as a potent and functional 
biological molecule. The skilled artisan, when equipped with tiie present disclosure and the 
teachings herein, will readily understand that the present invention provides compositions and 
methods for the modification of G-CSF. 

10 The present invention further encompasses G-CSF variants, as well known in the art. 

As an example, but in no way meant to be limiting to the present invention, a G-CSF variant 
has been described in U.S. Patent No. 6,166,183, in which a G-CSF comprising the natural 
complement of lysine residues and fluther bound to one or two polyethylene glycol 
molecules is described. Additionally, U.S. Patent Nos. 6,004,548, 5,580,755, 5,582,823, and 

15 5,676,941 describe a G-CSF variant in which one or more of the cysteine residues at position 
17, 36, 42, 64, and 74 are replaced by alanine or alternatively serine. U.S. Patent No, 
5,416,195 describes a G-CSF molecule in which the cysteine at position 17, the aspartic add 
at position 27, and the serines at positions 65 and 66 are substituted with serine, serine, 
proline, and proline, respectively. Otiier variants are well known in the art, and are described 

20 in, for example, U.S. Patent No. 5,399,345. 

The expression and activity of a modified G-CSF molecule of the present invention 
can be assayed using methods well known in Ihe art, and as described in, for example, U.S. 
Patent No.4,810,643. As an exanq)lei activity can be measured using radio-labeled 
thymidine uptake assays. Briefly, human bone marrow firom healthy donors is subjected to a 

25 density cut with FicoU-Hypaque (1 .077 g/ml, Pharmacia, Piscataway, NJ) and low density 
cells are suspended in Iscove's medium (GIBCO, La Jolla, CA) containing 10% fetal bovine 
serum, glutamine and antibiotics. About 2 X 10* human bone marrow cells are incubated 
with either control medium or the G-CSF or the preseot invention in 96-well flat bottom 
plates at about 37° C in 5% CO2 in air for about 2 days. Cultures are then pulsed for about 4 

30 hours with 0.5 ij,Ci/weU of ^H-tbtymidine (New England Nuclear, Boston, Mass.) and uptake 
is measured as described in, for escample, Ventua, et al,(1983, Blood 61:781). An increase in 
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^H-ttymidine incorporation into human bone marrow cells as compared to bone mairow cells 
treated with a control compound is an indication of a active and viable G-CSF compoimd. 

B. IFN aloha and IFN beta 
5 The present invention further encompasses a method for the remodeling and 

modification of IFN alpha and IFN beta. IFN alpha is part of a femily of approximately 
twenty peptides of approximately 1 8kDa in weight. IFN alpha and IFN beta, collectively 
known as the Type I interferons, bind to Ihe same cellular receptor and elicit siinilar 
responses. Type I IFNs inhibit viral repHcation, increase the lytic potential of NK cells, 
1 0 modulate MHC molecule expression, and inhibit cellular proliiferation, among other things. 
Type I IFN has been used as a therapy fbr viral infections, particularly hepatitis viruses, and 
as a therapy fbr multiple sclerosis. 

Current compositions of Type I IFN are, as described above, useful compounds for 
both the modulation of aberrant immunological responses and as a therapy for a variety of 
15 diseases. However, they are hampered by decreased potency and function, and a Umited hajf- 
Ufe in the body as conipared to natural cytoldnes comprising the natural complement of 
glycosylation. 

The prototype nucleotide and amino acid sequence for IFK alpha is set forth herein as 
SEQ ID N0:3 and SEQ ID N0:4, respectively (Figure 53A and 53B, respectively). IFN beta 

20 comprises a single gene product of approximately 20 kDa, the nucleic add and amino acid 
sequence of which are presented herem as SEQ ID N0:5 and SEQ ID N0:6 (Figure 54A and 
54B, respectively). The present invention is not limited to the nucleotide and amino acid 
sequences herein. One of skill in the art will readily appreciate that many variants of IFN 
alpha exist both naturally and as engineered derivatives. Similarly, IFN beta has been 

25 modified in attempts to achieve a more beneficid therapeutic profile. Examples of modified 
Type I IFNs are well known in the art (see Table 8), and are described in, fbr example U.S. 
Patent No. 6,323,006, in which cystedne-60 is substituted for tyrosine, U. S. Patent Nos. 
4,737,462, 4,588,585, 5,545,723, and 6,127,332 where an IFN beta with a substitution of a 
variety of amino acids is described. Additionally, U.S. Patent Nos. 4,966,843, 5,376,567, 

30 5,795,779 describe IFN alpha-61 and rFN-alpha-76. U.S. Patent Nos .4,748,233 and 

4,695,543 describe IFN alpha gx-1, whereas U.S. Patent No. 4,975,276 describes IFN alpha- 



-299- 



wo 03/031464 



PCT/US02/32263 



54. In addition, U.S. Patent Nos. 4,695,623, 4,897,471, 5,661,009, and 5,541,293 all describe 
a consensus IFN alpha sequence to represait all variants known at the date of filing. While 
this list of Type I IFNs and variants thereof is in no way meant to be exhaustive, one of skill 
tQ the art will readily understand that the present iavention encompasses IFN beta and IFN 
5 alpha molecules, derivatives, and variants known or to be discovered in the future. 

Table 8. Interferon-a Isoforms. 



gtype 



AA characteristic 



la^ A"^ 

lb V"^ 

2a K^^-H^ 

2b R^'-H^ 

2c R^-R^' 

4a A^^-E"-^ 

4b T^'-V"^ 

7a M'''-K'''-&'' 

7b M''^-Q'''~R''' 
7c 

8a V^^-L^^-C^''*'-D^'''-R'^^ 

8b S^-C^-V"°-M^°^-R^«^ 

8c S''^-C^-V^*^-M'°^-D**'iA(162-166) 



10a S^'-L' 



10b T«-I*' 



14a Fl52.Ql59_Rl61 

14b F^«K^59.Qi6i 

14c V'^-Q'^'-R''' 
17a 

17b B^.S''-t'' 

17c H^-S^^-R"^ 

17d H^-P^-R^^^ 
21a 

21b L''^ 



10 Methods of expressing IFN in recombinant cells are well known in the art, and is 

easily accomplished using techniques described in, for example U.S. Patent No. 4,966,843, 
and in Sambrook et aL (2001, Molecular Cloning: A Laboratory Manual, Cold Spring Harbor 
Laboratory Press, New York) and Ausubel et al. (1997, Current Protocols in Molecular 
Biology, Green & Wiley, New York). 
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